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PREFACE 


On behalf of the Conference Technical Committee, I would like to personally welcome 
all of you, especially those from overseas, to the 16 th Annual Review of Progress in 
Applied Computational Electromagnetics - the first ACES Conference of the new 
millennium, here at the Naval Postgraduate School in beautiful Monterey. 

We are fortunate to have with us this year three highly-regarded plenary speakers, Raj 
Mittra, Tom Cwik and Weng Chew, who will be discussing the challenges in 
computational electromagnetics that await as we move into the new millennium. The 
technical sessions this year cover a wide variety of topics ranging from Wavelet and TLM 
Modeling Techniques to Virtual Reality in Real-World Applications. Of special 
significance this year are the good number of international participants, some of whom 
have organized sessions and will also be chairing and presenting papers in the 
conference. This will also be the first year that there will be a special session devoted to 
the Student Paper Competition. We hope that you will all take advantage of the 
interactive poster session, vendor exhibits and the excellent program of short courses that 
have been organized for this years’ conference. 

We would also like to take this opportunity to thank the Naval Postgraduate School for 
serving as the host for the ACES 2000 conference. Special thanks goes to Dick and Pat 
Adler for their unending dedication and tireless efforts on behalf of ACES. We want to 
specifically thank all those who have helped to make ACES 2000 possible by organizing 
and chairing sessions. Finally, I would like to extend my personal thanks to my technical 
program committee team members — including co-chairs Randy Haupt and Ping Wemer, 
vendor chair Leo Kempel, short course chair Susan Hagness, publicity chair Keith 
Lysiak, and ACES webmaster Etef Elsherbeni — for all their hard work and dedication 
throughout the past year. 

We hope that the conference will provide a unique opportunity for all of you to come 
together from around the world to share your knowledge in computational 
electromagnetics by renewing old acquaintances and forming new collaborations. We 
also hope that ACES 2000 will be a memorable and productive conference for each and 
every participant. 






ACES PRESIDENT’S STATEMENT 


Welcome to the ACES 2000 conference! The ACES Annua! Review of Progress in Applied Computational 
Electromagnetics now has a 16-year tradition of excellence that guarantees a pleasant and educational 
experience. CEM specialists worldwide, both members and non-members of ACES, know they can find our 
conference in Monterey during the third week of March. The Naval Postgraduate School has been a gracious 
host and afforded excellent accommodations to ACES over the years, but a school calendar change has 
brought into question the week or location for our future conferences. The ACES Board of Directors will study 
this issue carefully before recommending any change, as both our date and place have been so well known 
for so many years. 

We again express our continuing gratitude and debt to Dick and Pat Adler for their (perennial) long hours 
devoted to ACES, including the conference local arrangements. Doug Werner and his capable conference 
team have put together an excellent technical program. I won’t attempt to credit all his enthusiastic and 
capable assistants here, but please come hear Doug’s opening remarks on Tuesday morning. Because the 
majority of our conference registrants continue to prefer a limited number of parallel sessions, ACES has 
restricted the number of accepted papers for the technical program this year. I hope you will appreciate and 
enjoy this distinctive feature of our conference. The ACES 2000 conference is more than just paper sessions, 
short courses, and award banquets, and ACES is actively striving to enhance your opportunities to have some 
quality time available for colleagues and friends, both old and new. If you see a way that we can further 
improve the structure and conduct of the conference, please do let us hear from you! 

For all the news about ACES 2001 as it is becomes available, remember to visit our Web sit at 
aces.ee.olemiss.edu on a regular basis! 

Perry Wheless 
ACES President 
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ACES 2000 SHORT COURSES/WORKSHOPS 


MONDAY 20 MARCH 2000 


0830-1630 SHORT COURSE#1 (FULL-DAY) 

"Computational Electromagnetic Methods in Mobile Wireless Communication Design" 

Ray Perez, Jet Propulsion Laboratory 

0830-1630 SHORT COURSE #2 (FULL-DAY) 

"XML and Modern Internet Technologies for Scientific Applications” 

Furrukh S. Kahn, Ohio State University. 

0830-1630 SHORT COURSE#3 (FULL-DAY) 

"The Basics of The Finite Difference Time Domain Technique for Electromagnetic Application" 
Atef Z. Elsherbeni and Allen W. Glisson, University of Mississippi 

0830-1630 SHORT COURSE #4 (FULL-DAY) 

"Techniques for Electromagnetic Visualization" 

Edmund K. Miller, Santa Fe, NM, and John Shaeffer, Marietta Scientific, Inc. 

0830-1630 SHORT COURSE#5 (FULL-DAY) 

"EIGER - Electromagnetic Interactions Generalized: An Introduction to and Tutorial on the 
Software Suite" 

Robert M. Sharpe and Nathan J. Champagne, Lawrence Livermore National Laboratory 
William A. Johnson, Sandia National Laboratories, Donald R. Wilton, University of Houston, and J. 
Brian Grant, ANT-S. 

0830-1130 HANDS-ON-WORKSHOP#6 (HALF-DAY, MORNING) 

"MATHCAD BASICS", Jovan Lebaric, Naval Postgraduate School. 

1330-1630 HANDS-ON-WORKSHOP #7 (HALF-DAY, AFTERNOON) 

"MATLAB BASICS", Jovan Lebaric, Naval Postgraduate School. 

0830-1630 SHORT COURSE #8 (FULL-DAY) 

"EMI/EMC Computational Modeling for Real-World Engineering Problems"" 

Omar Ramahi, Compaq Corporation, and Bruce Archambeault, IBM 


PLEASE NOTE THAT A 10% DISCOUNT IS IN EFFECT FOR ALL WORKSHOPS TAKEN AFTER ATTENDING AN INITIAL WORKSHOP. 
THIS APPLIES TO THE GROUP OF WORKSHOPS FOR MATLAB AND MATHCAD . 


FRIDAY 24 MARCH 2000 


0830-1630 SHORT COURSE #9 (FULL-DAY) 

“Why is There Electromagnetic Radiation and Where Does It Come From?” 

John Shaeffer, Marietta Scientific Inc., and Edmund K. Miller, Santa Fe, NM 

SHORT COURSE #10 (FULL-DAY) 

“Recent Advances in Fast Algorithms for Computational Electromagnetics” 

Weng Cho Chew, Jianming Jin, Eric Michielssen, and Jiming Song, University of Illinois at 
Urbana Champaign 


0830-1630 






ACES 2000 SHORTCOURSES/WORKSHOPS (cont) 


FRIDAY 24 MARCH 2000 


0830-1630 HANDS-ON-WORKSHOP #11 (FULL-DAY) 

“Method of Moments (MoM) Using MATHCAD” 

Jovan Lebaric, Naval Postgraduate School 

0830-1130 SHORT COURSE#12 (HALF-DAY, MORNING) 

“Computational Electromagnetics Using Beowulf-Cluster Computers” 
Tom Cwikand Daniel S. Katz, Jet Propulsion Laboratory 

1330-1630 SHORT COURSE #13 (HALF-DAY AFTERNOON) 

“Multiresolution FEM: Introduction and Antenna Applications” 

John L Volakis, University of Michigan and Lars Anderson, Agilent 

SATURDAY 25 MARCH 2000 


0830-1630 HANDS-ON-WORKSHOP #14 (FULL-DAY) 

"FD/FDTD Using MATLAB” 

Jovan Lebaric, Naval Postgraduate School, Monterey, CA 

0830-1630 SHORT COURSE #15 (FULL-DAY) 

"An Introduction to Radar Cross Section” 

John Shaeffer, Marietta Scientific Inc. 
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AGENDA 

The Sixteenth Annual Review of Progress in Applied Computational Electromagnetics 

NAVAL POSTGRADUATE SCHOOL, MONTEREY, CALIFORNIA, U.S.A. 

20-25 March 2000 

Douglas Werner, Technical Program Chair 
Randy Haupt, Symposium Co-Chair 
Pingjuan Werner, Symposium Co-Chair 
Susan Hagness, Short Course Chair 
Leo Kempel, Vendor Chair 
Keith Lysiak, Publicity Chair 
Richard Adler, Symposium Administrator 
Pat Adler, Conference Secretary 

MONDAY MORNING 20 MARCH 2000 

0700 - 0730 CONTINENTAL BREAKFAST Glasgow Courtyard 

(For short course and hands-on-workshop attendees only) 

0730-0820 SHORT COURSE/HANDS-ON-WORKSHOP REGISTRATION Glasgow 103 

0830-1630 SHORT COURSE #1 (FULL-DAY) Ingersoll 280 

"Computational Electromagnetic Methods in Mobile Wireless Communication Design” 

Ray Perez, Jet Propulsion Laboratory 

0830-1630 SHORT COURSE #2 (FULL-DAY) Glasgow 102 

“XML and Modem Internet Technologies for Scientific Applications” 

Furrukh S. Khan, Ohio State University 

0830-1630 SHORT COURSE #3 (FULL-DAY) Ingersoll 122 

“The Basics of The Finite Difference Time Domain Technique for Electromagnetic Application” 

Atef Z. Elsherbeni and Allen W. Glisson, University of Mississippi 

0830-1630 SHORT COURSE #4 (FULL-DAY) Engr Auditorium 

“Techniques for Electromagnetic Visualization” 

Edmund K. Miller, Santa Fe, NM, and John Shaeffer, Marietta Scientific, Inc 

0830-1630 SHORT COURSE #5 (FULL-DAY) Ingersoll 265 

“EIGER - Electromagnetic Interactions Generalized: An Introduction to and Tutorial on the Software Suite" 

Robert M. Sharpe and Nathan J. Champagne, Lawrence Livermore National Laboratory 
William A. Johnson, Sandia National Laboratories, Donald R. Wilton, University of Houston, 

And J. Brian Grant, ANT-S 

0830-1130 HANDS-ON-WORKSHOP #6 (HALF-DAY, MORNING) 

“MATHCAD Basics” 

Jovan Lebaric, Naval Postgraduate School 

0830-1630 SHORT COURSE #8 (FULL-DAY) Spanagel 101A 

“EMI/EMC Computational Modeling for Real-World Engineering Problems” 

Omar Ramahi, Compaq Corporation, and Bruce Archambeault, IBM 

0900-1200 CONFERENCE REGISTRATION Glasgow 103 

MONDAY AFTERNOON 

1330-1630 HANDS-ON-WORKS HOP #7 (HALF-DAY, AFTERNOON) 

“MATLAB Basics” 

Jovan Lebaric, Naval Postgraduate School 

1400-1800 CONFERENCE REGISTRATION Glasgow 103 

1700 BOD MEETING SP101A 

1900 PUBLICATION DINNER 
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TUESDAY MORNING 21 MARCH 2000 


0715 - 0745 CONTINENTAL BREAKFAST Glasgow Courtyard 

0740 ACES BUSINESS MEETING President Perry Wheless Glasgow 102 

0745 ACES Website Demo Atef Elsherbeni Glasgow 102 

0800 WELCOME Douglas Werner, Penn State Univeristy Glasgow 102 

0815 PLENARY SPEAKER: Raj Mittra, Penn State University Glasgow 102 

"Efficient Extraction of S-parameters of Transmission Line Discontinuities for RF and Wireless Circuit Design” 

SESSION 1: FINITE ELEMENT METHODS Parallel with Sessions 2, 3 & 4) 

Chairs: Jianming Jin and Peter Monk 


0920 

•Transient Electromagnetic Scattering from Cun/ed Dielectric/Lossy 

3D Bodies Using Covariant Projection Elements" 

R. Ordovas, S.P. Walker & M.J. Bluck 

0940 

Towards an hp-Adaptive Finite Element Method for Full-Wave 

Analysis of Waveguides” 

L. Vardapetyan & L. Demkowicz 

1000 

'An hp -Adaptive Finite Element Method for Maxwell’s Equations: 

A Progress Report” 

L. Demkowicz 

1020 

BREAK 


1040 

“Finite Element Method for Designing Plasma Reactors” 

Leo Kempel, Paul Rummel, Tim Grotjohn & John Amrhein 

1100 

“Finite-Element Domain Decomposition Through an Iterative 

Algorithm: Coupling Between Cavity-Backed Slots” 

Anastasis C. Polycarpou & Constantine A. Balanis 

1120 

“Investigation of the Bunting/Davis Functional when used 
with Vector Finite Elements for Waveguide Analysis” 

Andrew F. Peterson & Sharib Wasi 

1140 

“Numerical Methods for High Frequency Problems" 

T. Huttunen & P. Monk 

1200 

LUNCH 


SESSION 2: OPTIMIZATION IN ELECTROMAGNETICS 

Chairs: Eric Michielssen and Dan Weile 

(Parallel with Sessions 1, 3 & 4) 

0920 

“Design of Dual Band Frequency Selective Surfaces Using 

Genetic Algorithm" 

A. Monorchio, R. Mittra & G. Manara 

0940 

“A Study of Cauchy and Gaussian Mutation Operators in the 

Evolutionary Programming Optimization of Antenna Structures" 

Ahmad Hoorfar & Yuan Liu 

1000 

“A Statistical Intercomparison of Binary and Decimal Genetic Algorithms” 

Yee Hui Lee, Stuart J.Porter & Andrew C. Marvin 

1020 

BREAK 


1040 

The Compact Genetic Algorithm: A Litmus test for Genetic 

Algorithm Applicability" 

Daniel S. Weile, Eric Michielssen & David E. Goldberg 

1100 

“Dipole Equivalent Circuit Optimization Using Genetic Algorithm” 

Bruce Long, Ping Werner & Doug Wemer 

1120 

"Computing the Electromagnetic Field in a Perturbed Configuration 

Using Modified Reduced-Order Models" 

R.F. Remis & P.M. van den Berg 

1140 

Some Further Results From FARS: Far-Field Analysis of Radiation 

Sources" 

Edmund K. Miller 

1200 

LUNCH 


SESSION 3: NUMERICAL TECHNIQUES FOR PACKAGING AND INTERCONNECTS 
Chairs: Omar Ramahi and Andreas Cangelfaris 

(Parallel with Sessions 1,2, & 4) 

0920 

“A New Methodology for the Direct Generation of Closed-Form 

Electrostatic Green’s Functions in Layered Dielectrics" 

Andreas C. Cangellaris 

0940 

The Treatment of Narrow Microstrips and PCB Tracks in the 

FDTD Method Using Empirically Modified Coefficients" 

Chris J. Railton 

1000 

Time-Domain-Analysis of QTEM Wave Propagation and Crosstalk 
on Lossy Multiconductor Transmission Lines with Terminal Coupling” 

XX 

Georg Muller, Jan Wendel & Karl Reip 




TUESDAY MORNING 21 MARCH 2000 


SESSION 3: NUMERICAL TECHNIQUES FOR PACKAGING AND INTERCONNECTS (cont) 


1020 BREAK 

1040 “An MPIE-Based Circuit Extraction Technique and Its Applications 
on Power Bus Modeling in High-Speed Digital Designs” 

1100 “Non-resonant Electromagnetic Simulation of Some Resonant Planar 
Circuits" 

1120 “FDTD Analysis of Conventional and Novel Delay Lines" 

1140 “Complementary Operators for Frequency-Domain Method: A Single 

Simulation Implementation" 

1200 LUNCH 

SESSION 4: STUDENT PAPER COMPETITION 
Chair: Perry Wheless 

0920 "Systematic Studies in Annular Ring PBG Structures” 

0940 “Fast Electromagnetic Analysis Using the Asymptotic Waveform 

Evaluation Method” 


Jun Fan, Hao Shi, James L. Knighten 
James L. Drewniak 

Yuriy O. Shlepnev 


Omar M. Ramahi 
Omar M. Ramahi 

(Parallel with Sessions 1, 2 & 3) 

Todd Lammers, Shawn W. Staker, & Melinda Piket-May 
Dan Jiao & Jianming Jin 


1000 “A Domain-Decomposition/Reciprocity Technique for the Analysis 
Of Arbitrarily-Shaped Microstrip Antennas with Dielectric Substrates 
and Superstates Mounted on Circulariy-Cylindricai Platforms” 

1020 BREAK 

1040 “A New FDTD Scheme to Mode! Chiral Media” 

1100 T-Matrix Computer Code Applied to Electromagnetic Field 

Penetration in Magnetic Resonance Imaging” 

1200 LUNCH 

TUESDAY AFTERNOON 

1300-1530 INTERACTIVE POSTER SESSION 

1300-1900 VENDOR EXHIBITS 

1500-1700 WINE AND CHEESE TASTING 

SESSION 5: INTERACTIVE POSTER SESSION 

“Characteristics of Silicon Photoconductivity Under Near-Infrared illumination” 
"Characteristics of Fractal Antennas" 


R.J. Allard, D.H. Werner, & J.S. Zmyslo 


A. Akyurtlu, D.H, Wemer, & K. Aydin 

Rafael R. Canales, Luis F. Fonseca & Fredy R. Zypman 


Ballroom, Herrmann Hall 

Ballroom, Herrmann Hall 

Ballroom, Herrmann Hall 

Ballroom, Herrmann Hall 

Preston P. Young, Robert Magnusson 
Tim R. Holzheimer 

Haruo Kawakami, Yasushi Ojiro, Yasushi lizuka, 

Satoshi Kogiso & Gentei Sato 


“Feigenbaum Encryption of Computer Codes” 

“Extension of SuperNEC to Calculate Characteristic Modes” 

(THE PAPER “XPATCH 4..., BELOW IS BEING PRESENTED HERE 
PROCEEDINGS 

“Xpatch 4: The Next Generation in High Frequency Electromagnetic 
Modeling and Simulation Software” 


R.M. Bevensee 
Thomas Abbott 

IT WAS NOT RELEASED IN TIME FOR INCLUSION IN THE 

J. Hughes, J. Moore, S. Kosanovich, D. Kapp, 

R. Bhalla, R. Kipp, T. Courtney, A, Nolan, D. Andersh, 
F. German and J. Cook 


“Review of Basic 3D Geometry Considerations for Intelligent CEM 
Pre-Processor Applications” 

"Modelling of Loaded Wire Conductor Above Perfectly Conducting Ground 
by Using 3D TLM Method” 


KurtV. Sunderland 


Nebojsa S. Dondov, Bratislav D. Miiovanovic, 
Vladica M. Trenkic 
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TUESDAY AFTERNOON 21 MARCH 2000 


1300-1530 INTERACTIVE POSTER SESSION (cont) 

“Near to Far Reid Transformation for a FDTD BOR with PML ABC and 
Sub-Grid Capability” 

“Evanescent Tunneling and Quantile Motion of Electromagnetic Waves 
in Wave Guides of Varying Cross Section” 

“A Modal Approach for the Calculation of Scattering Parameters in Lossfree 
and Lossy Structures Using the Fl-Technique” 

“A Modular Technique for the Calculation of Wave Guide Structures” 

“Wave Propagation Through 2D Clusters of Coupled Cylindrical Resonators” 

"Design Software for Cylindrical Helix Antennas” 

The Analysis of a Center-Fed Helical Microstrip Antenna Mounted on a 
Dielectric-Coated Circular Cylinder Using the Reciprocity Theorem" 

“Near to Far Reid Transformation for a FDTD BOR with PML ABC and 
Sub-Grid Capability” 


Ballroom, Herrmann Hall 

Vicente Rodriguez-Pereyra, Atef Z. Elsherbeni, 

Charles E. Smith 

E. Gjonaj 

Rolf Schuhmann, Peter Hammes, Stefan Setzer, 

Bemd Trapp, & Thomas Weiland 

Johannes Borkes, Adalbert Beyer, & Oliver Pertz 

Ross A. Speciale 

M. Slater, C.W. Trueman 

R.A. Martin, & D.H. Wemer 

Vicente Rodriguez-Pereyra, Atef Elsherbeni & Charles Smith 


WEDNESDAY MORNING 

0715 - 0800 CONTINENTAL BREAKFAST 

0815 PLENARY SPEAKER Tom Cwik, Jet Propulsion Laboratory Glasgow 102 

“Design on Computer - A Coming of Age” 

SESSION 6: COMPUTATIONAL BIO-ELECTROMAGNETICS Parallel with Sessions 7 & 8) 

Chairs: Ray Luebbers and Susan Hagness 


0920 

“Numerical Investigation of Two Confocal Microwave Imaging 

Systems for Breast Tumor Detection” 

Susan C. Hagness, Xu Li, Elise C. Fear & Maria A. Stuchly 

0940 

“FDTD Studies on SAR in Biological Cells Exposed to 837 and 

1900 MHz inaTEM Cell” 

A.W. Guy 

1000 

“Modelling of Personnel Electromagnetic Radiation Hazards 

Deliberation of a Novice” 

Alan Nott 

1020 

BREAK 


1040 

“Modeling Interference Between Very Low Frequency 

Electromagnetic Fields and Implanted Cardiac Pacemakers” 

Trevor W. Dawson & Maria A. Stuchly 

1100 

“Using Computational Electromagnetics to Solve an Occupational 

Health and Safety Incident" 

Timothy Priest, Kevin Goldsmith & Dean DuRieu 

1120 

"Analysis of Permanent Magnet Type of MRI Taking Account of 
Hysteresis and Eddy Current and Experimental Verification" 

Norio Takahashi, Siti Zubaidah, Takeshi Kayano 

Koji Miyata & Ken Ohashi 

SESSION 7: VIRTUAL REALITY IN REAL-WORLD APPLICATIONS 

Chairs: Stan Kubina and Dennis DeCarlo 

(Parallel with Sessions 6 & 8) 

0920 

“A Virtual Radiation Pattern Range and Its Uses - 
C-130/Hercules HF Notch Antenna" 

Stanley J. Kubina, Christopher W. Trueman 

David Gaudine 

0940 

“HF Towel-Bar Antenna Location Study Aboard an H3 Sikorsky 
Helicopter” 

Saad N. Tabet, Carl D. Myers & Dennis DeCarlo 

1000 

“Improving Model Confidence through Metamorphosis" 

Douglas R. Munn and Chris Trueman 

1020 

BREAK 


1040 

“Model Morphing for Insight into the HF Assessment Parameters” 

Douglas R. Munn and Chris Trueman 

1100 

“3D Modeling of Complex Helicopter Structures: Prediction and 
Measurements” 

Anastasis C. Polycarpou, Dong-Ho Han 

Stavros V. Georgakopoulos 


& Constantine A. Balanis 
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WEDNESDAY MORNING 22 MARCH 2000 


SESSION 7: VIRTUAL REALITY IN REAL-WORLD APPLICATIONS (cont) 

1120 “Increasing the Productivity of NEC Analysis with Virtual 
Reality and 3D Laser Scanners” 

1140 “An Interactive HTML Based Multimedia Course on Antennas” 

SESSION 8: EMC 

Chairs: Bruce Archambeault and Jim Drewniak 

0920 "Adding Imperfections to EMC FDTD Models as a Means of 
Increasing Accuracy” 

0940 "Power Conversion Techniques for Portable EMI Sensitive 
Applications” 

1000 "Using the Partial Element Equivalent Circuit (PEEC) Simulation 

Technique to Properly Analyze Power/Ground Plane EMI Decoupling 
Performance” 

1020 BREAK 

1040 "EMI Model Validation and Standard Challenge Problems” 

1100 “Modeling EMI Resulting from a Signal via Transition Through 
Power/Ground Layers” 

1120 “Techniques for Optimizing FEM/MoM Codes” 

1140 “Numerical Modeling of Shielding by a Wire Mesh Box” 

LUNCH 

WEDNESDAY AFTERNOON 

SESSION 9: PROPAGATION 

Chairs: Steve Fast and Frank Ryan 

1320 “A Fast Quasi Three-Dimensional Propagation Model for Urban 

Microcells” 

1340 "FDTD Techniques for Evaluating the Accuracy of Ray-Tracing 
Propagation Models for Microcells" 

1400 “A Building Database Features Pre-Processor for 3-D SBR/GTD 
Urban EM Propagation Models” 

1420 ‘Toward a New Model for Indoor and Urban Propagation Using 
Percolation Theory” 

1440 “Ray Tracing Algorithm for Indoor Propagation" 

1500 BREAK 

1520 “Modeling Large and Small-Scale Fading on the DPSK Datalink 
Channel Using a GTD Ray-Tracing Model" 

1540 “Rough Surface Forward Scatter in the Parabolic Wave Equation Model” 

1600 “A Comparison of Electromagnetic Parabolic Equation Propagation 
Models Used by the U.S. Navy to Predict Radar Performance” 

SESSION 10: WAVELET AND TLM MODELING TECHNIQUES 

Chairs: Wolfgang J.R. Hoefer and Peter Russer 

1320 “The Implementation of a High Level (Ist-order) Haar Wavelet 
MRTD Scheme” 

1340 “Multi-Resolution Based TLM Technique Using Haar Wavelets” 


1400 “Formulation and Study of an Arbitrary Order Haar Wavelet Based 
Multi-Resolution Time Domain Technique" 


Kevin J. Cyber! & Daniel D. Reuster 

Ulrich Turk & Peter Russer 

(Parallel with Sessions 6 & 7) 

Colin E. Brench 

Reinaldo Perez 

Bruce Archambeault 


Bruce Archambeault & James L. Drewniak 

Wei Cui, Xiaoning Ye, Bruce Archambeault 
Doug White, Min Li & James L. Drewniak 

Y. Ji, T.H. Hubing, & H. Wang 

Gerald J. Burke & David J. Steich 


(Parallel with Sessions 10 & 11) 

Joseph W. Schuster & Raymond J. Luebbers 
Joseph W. Schuster & Raymond J. Luebbers 
James Pickeisimer & Raymond J. Luebbers 
G. Franceschetti, S. Marano, N. Pasquino, & I.M. Pinto 
C.W. Trueman, R. Paknys, J. Zhao, D. Davis, & B. Segal 

Kent Chamberlin, Mikhailo Seledtsov & Petar Horvatic 
Frank J. Ryan 

Donald de Forest Boyer & Huong Pham 

(Parallel with Sessions 9 & 11 

Enqiu Hu, Poman P.M. So, Masafumi Fujii, Wei Liu 
Wolfgang J. R. Hoefer 

Ismael Barba, Jose Represa, Masafumi Fujii, 

Wolfgang J.R. Hoefer 

Costas D. Sarris & Linda P.B. Katehi 
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WEDNESDAY AFTERNOON 22 MARCH 2000 


SESSION 10: WAVELET AND TLM MODELING TECHNIQUES (cont) 

1420 “Computational Optimization of MRTD Haar-Based Adaptive Manos M. Tentzeris 

Schemes Used for the Design of RF Packaging Structures" 

1440 "Time-Domain Simulation of Electromagnetic Wave Propagation J. Paul, C. Christopoulos & D.W.P. Thomas 

in a Magnetized Plasma” 

1500 BREAK 

1520 TLM Simulation of Patch Antenna on Magnetized Ferrite Substrate" M.l. Sobhy, M.W. R. Ng, R.J. Langley & J.C. Batchelor 

1540 “On the Practical Use of Layered Absorbers for the Simulation of Jurgen Rebel, Tobias Mangold, & Peter Russer 

Planar Microwave Circuits Using the SCN-TLM Method" 

1600 “A Numerical Study of MEMS Capacitive Switches Using TLM” Fabio Coccetti, Larissa Vietzorreck, Vitati Chtchekatourov, 

Peter Russer 

1620 Thin Wire Modeling with the TLMIE-Method" S. Lindenmeier, C. Christopoulos & P. Russer 

1640 “What Determines The Speed of Time-Discrete Algorithms” Tobias Mangold, Jurgen Rebel, Wolfgang J.R. Hoefer, 

Poman P.M. So, & Peter Russer 

SESSION 11: TIME DOMAIN METHODS AND APPLICATIONS (Parallel with Sessions 9 & 10) 

Chairs: Amelia Rubio Bretones and R. Gomez Martin 

1320 "Introducing a New Time-Domain Electromagnetic Field Solver Craig C. Ouyang, B.N. Jiang & Nina Liao 

LSFEM^D-SD” 

1340 “Characteristic-Based Time-Domain Method for Antenna Analysis" Dan Jiao, Jianming Jin & J.S. Shang 

1400 ‘Modeling of Thin-Wire Structures by Solving the EFIE in Time Domain” Friedrich Schunn & Hermann Singer 

1420 Time-Domain Analysis of Thin-Wire Loaded Antenna Using Integral M. Fernandez Pantoja, A. Rubio Bretones 

Equations* R. Gomez Martin 

1440 “Haar MRTD Wave Propagation Through Isotropic Plasmas” Ismael Barba, Jose Repress, Masafumi Fujii, 

Wolfgang J. R. Hoefer 
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Abstract 

In this paper, a 2D multiresolution time-domain (MRTD) scheme based on high level (lst-order) 
Haar wavelets is presented. The use of a lst-order wavelet scheme in MRTD leads to coupled time 
step updating equations. The boundary conditions can be very easily and directly implemented, and the 
reconstructed total field distribution is smoother than the field distribution obtained with a 0-order 
uncoupled Haar MRTD scheme. The most important advantage of this new scheme is its potential for 
considerable savings in memory and computational time. 

L INTRODUCTION 

Recently, the multiresolution time-domain (MRTD) approach based on wavelets has attracted the 
attention of many researchers in electromagnetic wave modeling. Battle-Lemarie[l], Haar[2], 
biorthogonal B-spline[3] and Daubechies[4,5] wavelets all have been tested as basic functions of 
MRTD, 

The most important advantage of this new technique is its capability to provide an effective way for 
time and space adaptive gridding through thresholding of the field wavelet expansion coefficients. If 
the value of a wavelet coefficient falls below a certain threshold, it will simply be discarded. This 
reduces the information stored in memory, yet the accuracy of the computational results is hardly 
affected. As a result, the mesh appears effectively coarser at these wavelet locations. Furthermore, the 
position and the number of wavelet expansion coefficients with significant value change in each time 
step, creating a space and time adaptive dense mesh in regions of strong field variations, while 
maintaining a much coarser mesh elsewhere. Thus the MRTD method based on wavelets in 
combination with a thresholding technique leads to an enhanced spatial resolution while reducing 
computational expenditure. 

In the case of Battle-Lemarie MRTD, when the field is sampled or injected, many terms of the field 
expansion coefficients must be considered by virtue of the nonlocal characteristics of the Battle- 
Lemarie wavelet functions. As compactly supported wavelets, Daubechies’ wavelets and biorthogonal 
B-spline wavelets require less terms, but they still lead to greater computational effort. Moreover, the 
implementation of boundary conditions is very complicated for Battle-Lemarie wavelets, Daubechies’ 
wavelets and biorthogonal B-spline wavelets MRTD schemes. In the case of 0-order Haar wavelet 
MRTD, it has been shown [2] that the scaling and wavelet updating equations are independent of each 
other. Coupling occurs only at source locations and at the boundaries. Hence, excitation and boundary 
conditions are usually implemented by interpolation. The resulting computational field distribution is 
not always smooth when the topology of the boundaries is complex. 


526 





In this paper, a 2D Multiresolution Time-Domain (MRTD) scheme based on high level (lst-order) 
Haar wavelets will be derived. By including lst-order wavelets we obtain coupled MRTD updating 
equations everywhere, and the reconstructed total field distribution is smoother than that obtained by a 
0-order uncoupled Haar MRTD scheme. The principal advantage of this new scheme is its potential to 
realize considerable savings in memory and computational time. 


n. Theortical formulation 

A. The Definition of the Basic Functions 

We employ a definition of Haar scaling and mother wavelet functions that differs slightly from the 
original definition. 


1 |,s| <0.5 

0(s) = 10.5 |i| = 0.5 

0 elsewhere 


Each electric and magnetic field component is expanded using pulse functions h n (t) in time domain 
and Haar scaling functions <p m (s) and r -order wavelet functions y/ r / ( 5 ) in space domain, where 
, 1 for (n - 0.5)Ar < t < (n - 0.5)A/ 

*■(*)“ 4 A I T. 

0 elsewhere ^a) 


0.5 

* = -0.5 

1 

- 0.5 < s < 0 

0 

0 

11 

-1 

0 < s < 0.5 

-0.5 

* = -0.5 

0 

elsewhere 




0 elsewhere 


-m), s = x,y,z 


<'(*) =2*V(2 

As 


r = 0, p = 0;r > 0, p = ±1,±2 ±2 r ~\s = x,y,z 



Fig. 1. ID Haar scaling and wavelet basic functions in a field node 

For lst-order Haar wavelet MRTD the spatial basic functions in one coordinate direction for a field 
node are shown in Fig. 1. The scaling and wavelet functions have the following important properties: 
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where (/(s)|g(s)) = £J/(s)g(s)dsdenotes the inner product of the functions /(s)and g(s). <Jis 
Dirac’s delta function. 


5 . 1st-Order 2D MRTD Scheme 

Consider Maxwell’s curl equations for the TE polarization case with the propagation vector in the 
x-z plane and the electric field pointing in y-direction. 

BE y {x,z,t) _ BHJx,z,t) 

dt (7a) 


3jc 


dE y (x,z,t ) _ dH x (x,z,t) 
Tz ~ M Bt 


(7b) 


^ =e B^ +oEy ( X , z , t)+Jj(x , z , t) 

Bz Bx Bt (7 C ) 

The basic functions in each 2D field node on the x-z plane are tensor products of the stun of scaling 
and wavelet functions multiplied by the time pulse function as follows: 

(8a) #(*){/* (z)ft„(f) ( 8b > 

fi(x)ytfl~\z)h n (t) (8c) ^(x)wl M (z)h„(t) (8d) 

VAxWMKit) (8e) V,{x)V k {z)h n {t) (8f) 
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The field components are expanded in terms of the above basic functions 

F s (x,z,t) = f>„ + „ 0 (0 x { £L'F'^ (X) + XX wl) 

*=-^0 l«=~» r=0 p ] 

r - r - n (9a) 

X s . + ^ 0 ^(z)+2S» + ^„V%(z)) 

L*=-~L r =0 /» JJ 

r = 0,p = 0;r>0 ) p = ±l,±2-±2 r - 1 (9b) 

F = E,H,J\ s = x,y,z\ % = <l>,yr, g = p,y (9c) 
for E y (x,z,t), F = £, s = y, =0, 4 =0, k 0 =0 (9d) 

for J y (x,z,t), F = J , s-y, ^ =0, 4 =0, fc 0 =0 (9e) 

,/br H x {x,z,t), F - H, s - x, =1/2, ^ = 0, fc 0 =1/2 (9f) 

for H.(x,zJ), F =H , s = z, Wq =1/2, z 0 =1/2, k 0 = 0 (9g) 

Subsequently, each field component is substituted in the Maxwell’s component equations [7a-c], 
and following Galerkin’s procedure and applying the Method of Moment, the resulting expressions are 
tested with the basic functions. By virtue of the inner product properties of the basic functions, we can 
obtain the time iterative difference equations for all expansion coefficients of all field components. 
The following are four examples of those equations, 
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C. Boundary Conditions 


Firstly consider perfect electric conductor (PEC) boundary conditions. According to the image 
principle, PEC walls can be modeled by imposing odd symmetry conditions for the tangential electric 
field. For the 0-order Haar wavelet MRTD scheme, the scaling and wavelet expansion coefficients are 
not coupled at the inner computational nodes, but only at the boundaries and at source nodes. 
Therefore, the PEC boundary conditions are usually implemented by combining scaling and wavelet 
expansion coefficients at the boundary nodes such that the total tangential electric field at those nodes 
is zero. In program implementation, the expansion coefficients are obtained by using the image 
principle and interpolation from the inner nodes so that the tangential electric field varies smoothly in 
front of boundary. 



Fig.2, The left-hand side PEC boundary in one-dimensional case 

For lst-order Haar wavelet MRTD schemes, as can be seen from the above time iterative difference 
equations of the expansion coefficients, the scaling and wavelet expansion coefficients are coupled. 
All expansion coefficients at the PEC boundary nodes can simply be set. For simplicity, the 
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implementation of PEC boundary conditions is described in the one-dimensional case, the results can 
be extended directly to the two-dimensional case. The actual position of the boundary is one half of the 
mesh size away from the center of the boundary cell. Fig. 2 shows the left-hand side boundary of the 
computational domain. So is the PEC boundary node. Sj is the first electric field node from So in the 
inner computational region. The precise boundary location is at Mo . The expansion coefficients of the 

node So can be enforced by using those of the node Sj: E$ = —Ef , Ef = Ef, Ef' 1 = Ef '*', 

Ef 1 = Ef 1 . Hence, it is straightforward to implement PEC boundary conditions in this scheme. 

Perfect magnetic conductor (PMC) boundary conditions can be imposed with equal ease. Perfectly 
Matched Layer (PML) absorbing boundary conditions can be implemented in a manner similar to the 
traditional FDTD method. The only difference is that each expansion coefficient must be handled 
independently in the PML. 


D. Stability Condition 

The stability condition for the high level wavelet 2D-MRTD scheme is 
At< 

-) 2 



(id 


where c is the wave propagation velocity. r x and r z are the maximum order of wavelets in x and z 
direction respectively. Due to the use of Ist-order wavelets, the spatial discretization intervals Ax and 
A z are four times greater than those of an equivalent FDTD grid. The lst-order Haar wavelet MRTD 
thus yields the same space resolution as FDTD with a time step that is two times greater than that of 
FDTD. Therefore, when modeling a process for a given time duration, this new scheme needs only 
about 50% of the number of time steps required by the conventional FDTD. 


E. Thresholding Technique 


Thresholding is the most promising technique for reducing memory requirements. It reduces the 
amount of stored information by omitting expansion coefficients that are smaller than a threshold 
value. When implementing the thresholding technique, the field coefficients must be evaluated and 
checked against the threshold value at every time step. This will increase computational overhead and 
may lead to a larger cumulative error. Therefore, the tradeoff between computational efficiency and 
accuracy should be taken into consideration. An optimum procedure must be found that reduces 
memory requirements and maintains computational efficiency at the same time. 

In our lst-order Haar wavelet MRTD scheme, thresholding technique is implemented by combining 
a relative and an absolute thresholding procedure. The relative thresholding compares the wavelet 
expansion coefficients to a given fraction of the scaling expansion coefficient at the same cell and at 
each tim e step. All wavelet expansion coefficients that fall below this threshold value are eliminated 
from the subsequent calculations. When the scaling expansion coefficient itself is very small, the 
absolute threshold procedure is employed; the absolute threshold value is usually set to a fraction of 
the ma ximum field value at the source. Both thresholding procedures do not add any significant 
overhead in execution time. In our program implementation, a dynamic allocating array is used to save 
all scaling and wavelet expansion coefficients for each field node. At the same time, a flag variable 
must be added to record what wavelet expansion coefficients are saved in the above array. The array 
and the flag variable must be modified at each time step. 
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EL NUMERICAL VALIDATION 

To validate this approach, we have performed two numerical examples. 

Firstly, four simple air-filled square resonator cavities for which analytical solutions are available, 
are solved and compared with the FDTD and the 0-order Haar wavelet MRTD methods. The 
dimensions of these cavities are shown in Table L To achieve the same order of computational 
accuracy, the same background mesh is used for all methods in each cavity. But the cell size of the 0- 
order Haar wavelet MRTD is twice that of FDTD while the cell size of the lst-order Haar wavelet 
MRTD is four times that of FDTD. The computational results are shown in Table L The errors of the 
0-order Haar wavelet MRTD is a little larger than those of FDTD, but the results of the lst-order Haar 
wavelet MRTD is almost the same as those of FDTD. At the same time, we have found that the 
waveform of the reconstructed field with 1-order Haar wavelet MRTD is much smoother. Thus the 
results indicate that 1-order Haar wavelet MRTD is much accurate than 0-order Haar wavelet MRTD 
even using the same accuracy meshes. 


Table I. Comparison of the lowest resonant frequencies computed with different methods 


Cavities 

sizes 

(mmxmm) 

Analytical 

value 

(GHz) 

FDTD 

0-order Haar wavelet MRTD 

lst-order Haar wavelet MRTD 

Mesh size 

WmSZBm 

Relative 

error 

Mesh size 


Relative 

error 

Mesh size 

Value 

(GHz) 

Relative 

error 

16X24 


16x24 

11.2585 

0.0115% 

8x12 

EE5Z9 

I'Mf 

4x 6 



16X32 

10.4746 

16x32 

10.4711 

0.0334% 

8x16 

10.4575 


4x 8 


H 

16x48 

9.8755 

16x48 

9.8701 

0.0547% 

8x24 

9.8515 


4x12 




9.6571 

16x64 

9.6510 

■•i&im 

8x32 


IWiLM 

4x16 

KE5B 

BBSS 


As the second example, a WR28 waveguide bandpass filter is analyzed to investigate the effect of 
thresholding on the accuracy and the memory saving. The waveguide is terminated at both ends by 
PML boundary conditions. The model is excited with a modulated Gaussian waveform source. Fig. 3 
shows the total field distribution obtained with lst-order Haar wavelet MRTD without thresholding at 
time 4000At. Fig. 4 shows the computed Sn and S 21 parameters with lst-order Haar wavelet MRTD. 

The memory saving is quantified by the wavelet expansion coefficient compression rate defined as 

_ Number of wavelet expansion coefficients omitted by thresholding ^$0% 

Total Number of scaling and wavelet expansion coefficients 

Table n gives the computed resonant frequencies and memory saving obtained by using 1st order 
Haar wavelet MRTD (at time step 8000) and 0-order Haar wavelet MRTD (at time step 8000) with 
different threshold values. For comparison, the results obtained with TLM are also listed. The same 
background mesh is used for all three methods. As can seen from table n, lst-order Haar wavelet 
MRTD can save much more memory, it is about 1.5 times that of 0-order Haar wavelet MRTD. The 
optimal relative threshold fraction is about 0.01%, and large memory savings (about 74%) are 
attainable by conserving a reasonable accuracy. 


Table n. Comparison of frequencies and memory savings obtained with different methods 


Relative 

Threshold 

fractions 

1 TLM 

i 0-order Haar wavelet MRTD 

| lst-order Haar wavelet MRTD j 

/1 (GHz) 

f 2 (GHz) 

fi (GHz) 

f 2 (GHz) 

a (%) 

fi (GHz) 

/z(GHz) 

a (%) 

0.001% 

31.2625 

31.9449 

31.5456 

32.2330 

41.35 

31.3412 

32.0236 

65.45 

0.005% 



31.5572 

32.2457 

45.61 

31.3471 

32.0291 

69.73 

0.010% 



31.5897 

32.2789 

49.58 

31.3587 

32.0413 

73.95 

0.050% 



31.9175 

32.6451 

53.85 

31.5379 

32.2552 

77.87 
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iv. Conclusions 

A 2D Ist-order Haar wavelet multiresolution time-domain (MRTD) has been implemented. There 
are 4 basic functions for representing EM fields on each node in each coordinate direction by including 
lst-order wavelets in MRTD. The new MRTD time step updating equations are coupled everywhere in 
the computational domain. PEC and PMC boundary conditions can be very easily and directly 
implemented without reducing computational accuracy. The reconstructed total field distribution is 
smoother than fields computed with a O-order uncoupled Haar MRTD scheme. The most important 
advantage of this new scheme is its potential for considerable savings in memory and computational 
time. The presented scheme can be directly extended to three space dimensions. 
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Fig. 3. The total field distribution obtained with lst-order MRTD without thresholding at time 8000At. 
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Fig. 4. Sji and S 2 i parameters obtained with lst-order MRTD without threshold at time 8000At. 
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Abstract -A first Haar-wavelet based multi-resolution formulation of the TLM method is proposed. The 
technique is derived for the simplest 1-D case. To validate it, we have computed the resonant frequencies of a 1- 
D resonator bounded by two electric walls. Results have been compared with resonant frequencies obtained by 
the traditional TLM method, and with the analytical values. 


I INTRODUCTION 

Multi-resolution time-domain (MRTD) techniques have recently been used to solve 
electromagnetic problems, using both Battle-Lemarie [1] and Haar [2] wavelets. They have usually 
been developed following the traditional FDTD update formalism. 

In this paper, we have formulated a Haar-based multi-resolution technique in terms of the TLM 
method. In this derivation we have taken advantage of the equivalence between TLM and FDTD 
and thus capitalized on previous work performed with FDTD. 

H FORMULATION 

II.a. 1-D MRTD Basic Functions and Algorithm. 

The Haar wavelet formulation has the advantage over other wavelet formulations that the resulting 
algorithm is similar to the traditional FDTD equations. However, in the multi-resolution method 
the equations are computed independently for each basis function. We can expand each field 
component in a combination of the Haar scaling (<(>) and wavelet (y) functions [3]: 

E y (f.t)=XX h .(t> bf ^(xHE?- V i(x)} (1) 

i=l n=l 


The notation is that used in [2], but reduced to one space dimension. We can expand the magnetic 
field (H z ) in orthonormal basis functions as well. If we substitute each component in Maxwell’s 
equations, expressions identical to the traditional FDTD update equations are obtained, and they are 
formulated independently for each basis function coefficient. 
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n.b. l-D FDTD-TLMEquivalence. 

It is well known that the TLM method [4] is based on the isomorphism between Maxwell’s 
equations and transmission line equations: the propagation of voltage and current impulses through 
a mesh of transmission lines is computed, yielding the equivalent electric and magnetic field 
components. The usual identities between TLM and field quantities are: 

feV, 1=H (2) 

For example, the discrete voltage pulses at the nodes of a TLM mesh are given by: 

Vfct) = ii.V,8n(t)5,(x) (3) 

i=1 n=l 

where Sn(t), 8i(x) are Dirac pulses at t=nAt, x=iAx, respectively. Theses voltage samples are 
equivalent to the discrete electric field samples: 

E(f, t ) = ti D E i h„(t)t. n (x) (4) 

i=l n«l 

that can be computed and updated with FDTD. However, since the FDTD scheme has an 
equivalent TLM formulation, both algorithms should give the same result. If we now apply a Haar 
wavelet decomposition to the field in Eq. (4), we obtain the usual MRTD model in terms of scaling 
and wavelet functions [3]. Every set of basis functions is calculated independently with the FDTD 
updating procedure. By virtue of the equivalence given in Eq. (2), two separate TLM processes can 
be employed, one for each term of the field in Eq. (1). For example, in figures 1 and 2 the 
decomposition of a sinusoidal electric field (the first resonant mode between two parallel electric 
walls) is shown: 




b) 


Figure I. Scaling functions for a) Electrical Field b) Equivalent TLM mesh voltage. 




b) 


Figure 2. Wavelet functions: a) Electric Field wavelet functions, b) 
Equivalent pulses in the second TLM mesh. 


535 




II. b. Boundary Conditions. 

As in FDTD-based MRTD, boundary conditions should be applied to the whole field/voltage, 
involving thus both TLM meshes. This can be handled in two ways: 

a) An equivalent circuit, connecting both TLM meshes, can be calculated. 

b) A condition for the total field can be applied at each boundary node to calculate the 
reflected pulses in the transmission lines. 

To preserve the analogy with FDTD-based MRTD, we have employed the second option. For 
example, for a PEC wall placed in i-1, the voltage impulses incident on the node at i=l in both 
TLM meshes are shown in figure 3. 

,y{a) nVj(l) 

-1- 

nVf(l) n Vj(l) 

-1- 

rVid) nTO 


Figure 3. Incident pulses in both (scaling and wavelet 

equivalent) transmission lines, for i=l. 


In this figure, nV^l) is the pulse incident from the left in the TLM mesh for the scaling function, 
for i=l, nVV^l) is the pulse incident from the right in the same node. n Vi(l) is the total voltage in 
the left side of the cell ( n V l (l)= n VVl)- n V ¥ (l) equivalent to while n Vr(l) is the total 

field in the right side of the cell. Computing both fields, we obtain: 

n v 1 (i)= n vVi)+ n v v (i)= n v;(i)+„v;(i)+ n v 1 v (i)+ n v r v (i) 

. v r (1)=„V* (1)-„V * (l)= n V* (D+.v* (l)-nVr (l)-„ v r v (1) (5) 

Coefficients n V*(l) and n V r ¥ (l) can be calculated by means of the transmission coefficients 
between neighboring nodes. n V|*(l) and n V, ¥ (l) can be calculated employing Eq. (5) and the fact 
that, if we place the PEC wall in x=Ax/4, n V|(l)=0. To obtain a second condition, we can require 
that there be no dispersion in the reflected signal. Finally, the equations for the PEC wall would be: 


y+Q) - n -,V;(l)+n- l V t V (l)-nV;(l)-nV7a) 
n yy (1) „-,V^(lK-lV f ,y (l)-nV r ' > (l)- n V f V '(l) 

A similar reasoning would give us the boundary conditions for i=I. 
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m RESULTS 


Resonant frequencies of a ID resonator bounded by two electrical walls have been calculated. The 
separation between both walls is 0.3 m. That means the theoretical frequencies should be 0.5 - 1 - 
1.5 -2. ..GHz. 

A first simulation has been performed by dividing the space between the walls into 30 cells, i.e., 
taking into account the different positions of the PEC walls in both schemes. The cell size of the 
TLM mesh is 0.01 m, and that of the MR-TLM is 0.0102m, so that the time steps are At=33.3 and 
33.9 ps, respectively. We have performed 2000 time steps, employing both the traditional TLM 
method and our Multi-resolutive TLM formulation. Results are compared in table 1 and figure 4. 


Table 1: Resonant frequencies in GHz of a ID resonator bounded by 
two electric walls, discretized into 30 cells. _ 


TLM 

.50 

■Mill 

1.50 



1 3.00 j 

MR-TLM 

.50 


1.50 

12.00 [ 





Figure 4. Resonant frequencies in GHz of a ID resonator bounded by 

two electric walls, discretized into 30 cells. 


If we reduce the number of cells between the two walls, a larger difference between both methods 
should be found in the subsequent figure. For example, if the number of nodes is 15 (i.e. 
Ax=0.02/0.0207m in every case, and At=66.7/69.0 ps). Results are shown in table 2, and figure 5. 


Table 2: Resonant frequencies in GHz of a ID resonator bounded by 
two electric walls, discretized into 15 cells.___ 


TLM 

KM 

IWiTl 

IKM 

2.00 

2.50 



MR-TLM 

KM 

IE3 

IBM 

2.00 

2.50 

3.00 

3.50 
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0 1 2 3 4 

Figure 5: Resonant frequencies in GHz of a ID resonator bounded by two 

electric walls, discretized into 15 cells. 

A stronger reduction of the number of cells (10 nodes, i.e. Ax=0.03/0.0316m, At=100/105.3 ps), 
would lead to the results shown in table 3 and figure 6. 

Table 3: Resonant frequencies in GHz of a ID resonator bounded 



O 1 ~ l 2 ^ ' r r 3 ' r 4 


Figure 6: Resonant frequencies in GHz of a ID resonator bounded by two 

electric walls, discretized into 10 cells. 
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IV CONCLUSION. 


Haar wavelets have been applied to obtain a multi-resolution formulation of the TLM method. 
Resonant frequencies of a one-dimensional resonator formed by two electric walls have been 
computed to validate the ID algorithm and boundary PEC boundary condition. Since a ID TLM 
simulation is dispersion free, any eixor in the obtained frequencies could only be caused by errors 
in the sampling of the results and the DFT employed, provided that the boundary conditions and the 
algorithms themselves (TLM and MR-TLM) are properly formulated and implemented. 
Differences in the shape of spectral lobes are due to the different electrical position of the output 
probe (in the middle of a TLM cell, and at !4 or 3 A of a MR-TLM cell, respectively). 
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Abstract 

The use of wavelet bases for the discretization of Maxwell’s equations via the Moment Method 
has led to the development of novel time domain schemes with inherently adaptive properties. 
Due to the latter, such schemes have attained significant economy in memory requirements when 
applied to electromagnetic and circuit problems. For robust, adaptive schemes, arbitrary order 
wavelet formulations have to be employed, in order to provide the capability of modeling arbi¬ 
trarily fine geometric details of a given structure. In addition, one of the remaining challenges for 
multiresolution time domain techniques is to efficiently incorporate non-uniform time discretiza¬ 
tion in order to maximize their performance in terms of CPU time requirements. Both questions 
are addressed in this paper for the case of Haar wavelet basis. 


1. INTRODUCTION 

Time domain differential methods are becoming increasingly popular among the electromagnetic 
co mmuni ty as a consequence of their versatility and their ability to provide simulation results that 
are intuitively meaningful to circuit designers and microwave engineers. However, the poor scalability 
of the computational demands that methods such as the Finite Difference Time Domain (FDTD [1]) 
or the Tr ansmis sion Line Matrix (TLM [2]) method present, has posed the significant question of 
deriving new techniques that can perform reasonably accurate computations, with computational 
requirements that can be met by nowadays computer performance. 

In general, non-uniform meshing, based on adaptivity criteria can significantly improve the effi¬ 
ciency of a numerical scheme. Still, a remaining problem comes from the typically over-restrictive 
limitation that the minimum cell size of the grid imposes on the time step [3]. For this problem to 
be resolved, computationally burdensome techniques such as extrapolation or interpolation of field 
quantities at different mesh interfaces have been proposed in the past [4, 5]. Apart from the apparent 
complexity and the related computation time overhead that is related to the latter methods, the 
possibility of adaptive mesh refinements of various orders at several positions of the computational 
domain appears to be quite complicated within their context. 

Wavelet based time domain techniques, such as the Multiresolution Time Domain Method (MRTD 
[6]), offer a promising alternative that overcomes the previously mentioned problems. Indeed, an 
adaptively non-uniform, moving mesh is effectively implemented by merely thresholding the time 
dependent wavelet coefficients of a field (or circuit) quantity series expansion at each time step. Such 
an approach leads to both memory and computation time economy for the reason that thresholded 
coefficients do not participate in operations that are carried out in subsequent time steps. 

However, most research studies in this area have concentrated on either low order wavelet schemes 
( employing one or two [7, 8] wavelet levels ), or on higher order wavelet schemes applied to structures 
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Figure Is Haar scaling function and 0-wavelet. Figure 2: Haar wavelet functions of order 1. 


that contain inhomogeneities of the order of the scaling function [9]. Hence, a remaining challenge for 
multiresolution techniques is to implement arbitrary order schemes that can model geometric details 
at the highest wavelet resolution, which may be much smaller than the scaling cell size. Applications 
of such schemes were recently presented [10] and a more detailed development for them is herein 
given. An operation count is approximated and inherent memory - computation time - accuracy 
trade-offs are discussed. Source and hard boundary modeling is also presented and validated. Finally, 
the problem of introducing time adaptive stepping is approached and some preliminary considerations 
are given. 

Throughout this paper, the Haar basis is employed for all schemes that are derived. This is due 
to the relative simplicity of Haar Multiresolution Time Domain (MRTD) schemes, that constitute a 
generalization of Yee’s FDTD. However, from a mathematical point of view, the same methodologies 
are to be followed for similar derivations in any wavelet basis. 

2. FORMULATION OF ARBITRARY ORDER HAAR WAVELET MRTD 


A. Definitions 

For the extension of Haar MRTD to arbitrary orders, the following definitions are employed (the 
difference from similar definitions adopted in [8] and similar works should be noted ). 

The Haar scaling function is defined as a pulse (Figure 1) : 


m = 


{ 


i, 

0 , 


0<£<1 

otherwise 


(i) 


and the scaling function basis is produced by orthogonal translations of the latter: 




The Haar mother wavelet function is defined as (Figure 1): 


r i, o < i < 1/2 

V>(£) = | - l, 1/2 < e < l 

( 0, otherwise 
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( 2 ) 

(3) 


































and the basis is now produced by both translations and dilations, as shown in the definition: 

(4) 

The parameter r denotes the wavelet resolution, while p = 0, l,--*,2 r — 1 denotes the position of 
the wavelet within a cell defined by a scaling function First order Haar wavelet functions are 
depicted in Figure 2, for the role of p to become evident. In all previous definitions, the variable £ is 
a normalized variable, representing for example in the case of a ^-directed scaling/wavelet function, 
the ratio z/Az, A z being the cell size in z. direction. 

B. Generalized Finite Di fference Equations 

An arbitrary order Haar MRTD scheme is now formulated for the transmission line equations ( 
Telegrapher’s equations) : 

dv(z,t) T di(z, t) /M 

- ST (5) 

di(z,t) _ _ c au(z,t) 

dz dt ' 

Voltage and current are expanded in a series of scaling and wavelet functions : 

v(z, t) = £ kV* K(z) hk(t) +Y,E kV$'»i> T mj) (z) h k (t) (7) 

m,k m,k rj> 

i(Z)t) = fc+0.5-^+0.5 rfhn+O.siz) hk+0.s{t) + ^ fc+0.5^m+0.5^m+0.5^(^) h k+ 0.5 W (8) 

m,k m,k rj> 

For the formulation of the scheme via the Method of Moments [6], integrals of the form : 

£>*«« (9) 

with : i 

m = M A) or ^ >p (A) 

»(A) = K'± i/s(A) or <, ± 1 / 2 , p ,(A) (10) 

and \ — x,y,z are evaluated in a closed form. Thus, finite difference equations for arbitrary orders 
of wavelet expansions can be derived, and write as follows : 

i+ 1 v* = C(r,p )((“) 

r,p 

k+ 1 V^> = k V^» - C(r,p) ( k+i I* m+i - 

-£tz £ Cf *+jCi) <«> 


C(r,p) = 2i 1 (i p , 2 .-. - ip, 2 r-i-i) 

Vi(r,P,r',p') = 

2^{^(2 r (0.5 - i + £r) -P) -2V(2^(0-5 - i + ^ -p) + V>(2 r (0.5 - i + -p)} 

i = 0 , 1 . 

and it ,a being Kronecker’s delta function. Dual equations hold for current coefficients. 
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5. WAVELET EXCITATION AND HARD BOUNDARY MODELING 


It has been already mentioned [8, 11] that Haar MRTD schemes with only zeroth order wavelets 
consist of uncoupled equations. In such schemes, wavelet coefficients become coupled to the scaling 
coefficients through the excitation and the boundary conditions [11]. A similar problem is encoun¬ 
tered in the general case of an arbitrary order scheme, since by the multiresolution principle, a 
multiresolution decomposition of the approximation space 14 [12] as : 


v k = Vo e Wo e Wi e • • • © w k „i (14) 

is equivalent to : 

V k = V k -i(bW k - 1 (15) 

The latter expression corresponds to a one wavelet level multiresolution analysis, in which scaling 
functions are combined with the (uncoupled) highest order wavelets of the former decomposition. 
Hence, in our experiments, a scaling excitation is sufficient to excite sill wavelet terms except the 
highest order ones. For these terms , an explicit excitation scheme has been derived and validated, 
based on the application of continuity condition at planes within the excitation cell, generalizing the 
procedure described in [11]. These planes are defined by the scaling functions corresponding to the 
approximation space V*_i (Figure 4). 

For the modeling of hard boundaries, it is interesting to note that image theory can be applied in 
a straightforward way due to the form of Haar wavelets (Figure 3). Hence, perfect electric / magnetic 
conductors (PECs/PMCs) or short/open circuits in circuit geometries respectively, can be modeled 
by appropriate matching of wavelet functions. Such a modeling, applied to a TM Z incidence on a 
PEC is validated by the results shown in Figures 5-8 for a second order MRTD scheme. Total field 
pattern and second order wavelet coefficients for the tangential to the boundary magnetic field H y 
are shown to assume the expected form in this case. 





Image Theory 







P.E.C. 

or 

P.M.C. 


Image Theory 


Figure 3: Image principle applied to hard boundaries ; (-) image and (+) image, derived by multiplying the 
wavelet function by (+1), (-1) respectively. 
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Figure 4: Excitation cell refined by four second order wavelets in a second order MRTD scheme. Dashed 
lines indicate the position of planes where the continuity condition is applied in the sense of [11]. 


4. EXECUTION TIME AND MEMORY ECONOMY IN NON - ADAPTIVE AND SPACE ADAPTIVE MRTD 
A. Non — adaptive MRTD 

The execution time budget of Haar MRTD is compared to the conventional FDTD without thresh¬ 
olding being taken into account. First, a stability analysis is performed, based on von Neumann’s well 
known method [13] and verified by numerical experiments. It was found that the stability condition 
for a general three - dimensional scheme is : 


A i< —• 


1 

WF+Wf+W 


(where u p is the maximum phase velocity in the computational domain). In case R x = Ry = R z = R 
and Ax = Ay = Az — A, a resolution of achieved by the MRTD technique, with a maximum 

time step : 

(17) 

On the other hand, FDTD can achieve the same resolution with a time step [1] 

A< — D = ^2^r = 0 - 5Ai —° (18) 

The same relationship between At^^ D and Aholds for the one dimensional scheme presented 
in 2.B . Then, taking into account all possible algebraic manipulations (pairing of same order terms 
to avoid excessive multiplications), the number of multiplications per scaling cell, per MRTD time 
step for MRTD schemes and FDTD is compared. The one dimensional scheme treated in section 
2 is considered. Our measure of comparison takes into account both resolution refinement and the 
difference in time steps. Table 2 shows the results for MRTD orders of zero, one, two. 









Magnetic field reflection from a PEC - MRTD Oder- 2 


Magnetic field reflection from a PEC - MRTD order = 2 


Figure 5: Total incident and reflected magnetic Figure 6: Zeroth order wavelet coefficients of 
field from a PEC. incident and reflected magnetic field from a PEC. 


Magnetic field reflection from a PEC - MRTD order - 2 


Magnetic field reflection from a PEC - MRTD order - 2 


- 2-wavelets 


Figure 7: First order wavelet coefficients of in- Figure 8: Second order wavelet coefficients of 
ddent and reflected magnetic field from a PEC. incident and reflected magnetic field from a PEC. 


T&ble 1: Multiplications per scaling cell, per MRTD time step. 
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It is evident, that with the exception of zero order Haar MRTD, higher order schemes involve more 
operations. Therefore, higher order schemes (at least for the Haar basis which is studied here) are 
meaningful and efficient only when they are combined with thresholding techniques. 

B. Space Adaptive MRTD 

Space adaptivity is introduced by means of coefficient thresholding as indicated in [14], at the 
expense of accuracy of the solution. Usually, wavelet based methods can tackle this trade - off 
relationship efficiently and, up to a point, the higher the order of the scheme that one uses, the 
more efficiently this is achieved [10]. For an advection equation, such as the system of transmission 
line equations, for R being the maximum wavelet resolution and N being the grid points that a 
conventional FDTD scheme uses, the operations per time step are for a space adaptive scheme [15] : 



where n T are the unthreshoded coefficients at resolution r. The latter expression shows the potential 
for attaining significant time savings with wavelet based solvers, when n r is small as is the case in 
most applications [10, 14]. 

However, these time savings are attained at the expense of memory, since the use of data structures 
that “monitor” the sparsity of the vector of unknown coefficients step by step is necessary. 

It should be also noted that grid coarsening is restricted by the Nyquist limit. 

5. TIME ADAPTIVITY IN MRTD SCHEMES 


A. Time Wavelets 

Using wavelets in time, in a similar manner to which wavelet based spatial discretization is per¬ 
formed via Galerkin method, leads to a time adaptive gridding. Time adaptivity is again brought 
about by thresholding of time wavelet coefficients [16]. However, for the Haar basis, our numerical 
experiments have shown that the validity of this method is only limited to zero order wavelets [10]. 
Beyond that, it leads to unstable schemes. In general, the Wavelet - Galerkin technique does not 
yield stable time - stepping schemes, as can be seen from the roots of the finite difference equation 
for the scaling coefficients, when , for example, a Daubechies basis is used [17]. 

B. Adaptive Time Stepping 

Time adaptive stepping is an important method to optimize the execution time requirements of 
a general wavelet based solver. The technique is presented in [15] and suggests that the update 
of terms which correspond to interactions between scaling - scaling, scaling - wavelet and wavelet 
- wavelet functions can be carried out at different time steps. These time steps are determined 
by the approximation space within each such interaction “lives”. Terms that belong to successive 
approximation spaces ( V* and Vk+i ) are updated at time steps that differ by a factor of two (higher 
resolution terms are updated twice as often as the next lower resolution terms). The advantage of this 
approach is that the highest wavelet resolution within the computational domain does not impose a 
restriction on the time step for the update of lower resolution terms. 

Evidently, this algorithm cannot be implemented when E/H (or V/I) values are offset by half time 
step. If for example the magnetic field H is sampled at (k + 0.5) At then its value at (k 4* 0.5) x 2At, 
which is necessary for the update of terms that live in the second highest order approximation space, 
is not available and can only be interpolated. Therefore, all field values should be sampled at integer 
time steps. 

Also, the implementation of the algorithm requires storage of unknown coefficients at “previous” 
time steps, the length of the storage window being related to the highest wavelet resolution ( in 
particular, if R is the highest wavelet resolution introduced in the grid, 2 R previous time steps 
should be kept in memory). 
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6. CONCLUSION 


The formulation and study of an arbitrary order Haar MRTD technique has been presented. It 
should be emphasized that the introduction of higher order wavelets only makes sense when it is 
combined with thresholding techniques that render the method space adaptive. It has also been 
mentioned that thresholding of wavelet coefficients is not only connected to memory but to time 
savings as well. Time adaptivity algorithms which, when introduced into spatially adaptive wavelet 
methods, have shown the promise of providing a highly efficient alternative to subgridding techniques, 
are currently under study. 
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Abstract 

The MRTD adaptive gridding scheme that is based on the Haar expansion basis is discussed 
for arbitrary wavelet resolutions. Guidelines for the optimization of memory and execution time 
requirements axe presented. A multi-time-stepping procedure enhances further the computational 
economies offered by a combination of absolute and relative thresholding of the wavelet values. 
Keywords: Multiresolution, Time-Domain Techniques, Adaptive Gridding, Wavelets, 
Memory Compression, Thresholding, Haar 


I Introduction - Discussion on the Haar expansion basis 

Significant attention is being devoted now-a-days to the analysis and design of various types of RF 
Packaging structures (e.g. Flip-Chip, Multi-Layered Structures) used in Wireless and Computing 
applications. Though The finite-difference-time-domain (FDTD) scheme is one of the most powerful 
and versatile techniques used for numerical simulations, it suffers from serious limitations due to 
the substantial computer resources required to model such electromagnetic problems with medium 
or large computational volumes. Recently, MRTD (MultiResolution Time Domain Method) [1]—[9] 
has shown unparailed properties in comparison to Yee’s FDTD. In a MRTD scheme the fields are 
represented by a two-fold expansion in scaling and wavelet functions with respect to time/space. 
Scaling functions guarantee a correct modelling of smoothly-varying fields. In regions characterized 
by strong field variations or field singularities, higher resolution is enhanced by incorporating multiple 
resolutions of wavelets in the field expansions. The major advantage of the use of Multiresolution 
analysis to time domain is the capability to develop time and space adaptive grids. This is due to the 
property of the wavelet expansion functions to interact weakly and allow for a spatial sparsity that 
may vary with time through a thresholding process. Haar expansion basis functions (Fig.(l)) provide 
a convenient tool for the transition from FDTD to MRTD due to their compact support, and to their 
similarity with the FDTD pulse basis [6]-[9]. Nevertheless, the enhancement of additional wavelet 
terms the number of which is different from cell to cell and for different time-steps requires a careful 
consideration of the memory and execution time overheads. In addition, the fact that the stability 
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limit of the time-step decreases as more resolutions are added requires the use of an effective multi- 
time-stepping algorithm, that will maintain the required accuracy without increasing significantly 
the execution time requirements. 


II MRTD Scheme with Multiple Wavelet Resolutions 

For simplicity, the ID MRTD scheme for TEM propagation will be discussed. It can be extended to 
2D and 3D in a straightforward way. The Electric (E x ) and the Magnetic (H y ) fields are displaced by 
half step in both time- and space-domains (Yee cell formulation) and are expanded in a summation 
of scaling (0) and wavelet (i/v) functions in space and scaling components in time. For example, E x 

» rmai 2 r , 

E x ( Z ,t) = £ u, a) 

m,:~—oo r=0 i r =l 


where <f>i{z) = <p(z/Az - i) and $' tr {z) = 2 T ^xfjo{2 r (z/Az - i r ) - i) represent the Haar scaling and 
r-resolution wavelet functions located inside the t—cell. The conventional notation m E x j is used for 
the voltage component at time t = mAt and z = tAz, where At and Az are the time-step and the 
spatial cell size respectively. The notation for H y is similar. 


Substituting E x ,H y in the TEM equations and applying Galerkin technique derives the following 
equations for H y 

m+0.5#y jt -_o.5 = m-0.5^y s i_o.5 “ ~ *** 

Tmaz 2 T ™ Z 

+ E + Cl, r ,1,1 + E r,i r i,0 ( 2 ) 






+ E ( c 2,r^,-l < r 4i + E C 2,r,*ri,0 E X ? T )i ( 3 ) 

r=l »rl=l 


rr^’V (J rp<P . J p0O 

n+0.5^,i-o. 5 = m—0.5i* y ,i-o.5 “ •&*,*-! + <V,V,2,-1 

Tmo* . *2 

+ E (^V. ie ^>V»r, 2r + E e r / } V 1 r,* r ,0^ x> t ,,r ) 

r=l *r=ii 

+ <5 ijJj2 r'-i (dr'.v.l.O ^ AO 

^+l,2 T ‘ f } _ rr^t V ^ / J , J,. P‘00 




^moi *2 

+ E ,2V er ' »V » r > 1 ’ 1 ^x,i+l + E ^ »V ,0 ^x,r ) 

r=l r *r=»l 


( 5 ) 




where: i\ = max[l,INT((i r - 1) 2“( r '~ r )) +1] and %i ~ min[2 r ,INT(l + ir 2“( r '~ r ))]. Also, the c,d 
and e coefficients can be calculated by the Galerkin technique correlating the respective wavelet and 
scaling functions and 8 is the Kroenecker Delta. For example, 

c l,r,iriP = / = 2r/2 ' 1 (5i r ,2"(<5p,-l - Sp, 0) + - M) (6) 

Through the split of the wavelet coefficients of resolution r' > 0 to two groups ([l,..,2 r, “ 1 ] and 
[2 r _1 4- l,..,2 r ]) the calculations are performed more efficiently in terms of the scaling and wavelet 
of 0-Resolution contributions. In addition, it is clear that the calculation of these wavelet coefficients 
require a significantly smaller number of terms than 2 rmox+1 , something that proves that the num¬ 
ber of operations increase significantly slower than 0(2 2 ^ rrTulI+1 ^) as the maximum resolution r^x 
increases. 

II. 1 Hard Boundaries 

Due to the finite-domain nature of the expansion basis, the Hard Boundary conditions (Perfect 
Electric/Magnetic Conductor) can be easily modeled. For example, if a P.E.C. exists at the 2 = iAz, 
then the scaling E x coefficient for the i — cell has to be set to zero for each time-step m since 
the position of the conductor coincides with the midpoint of the domain of the scaling function. 
Nevertheless, the O-resolution wavelet for the same cell has the value of zero at its midpoint; thus 
its amplitude does not have to be set to zero. To enforce the physical condition that the electric 
field values on either side of the conductor are indpendent from the files on the other side, TWO 
O-resolution wavelet E x coefficients have to be defined. The one (on the one side of P.E.C.) will 
depend on H y values on this side only and the other (on the other side of P.E.C.) will depend on 
H y values on that side only. Wavelet coefficients of higher-resolution with domains tangential to the 
position of P.E.C. have to be zeroed out as well. As far as it concerns the equations that update the 
coefficients of the magnetic field H y , only E x coefficients on the same side of the P.E.C. have to be 
used. The rest of the summation terms have to be replaced with coefficients derived applying the odd 
image theory for the electric field. A similar approach can be applied for the modeling of a Perfect 
Magnetic Conductor (P.M.C.). 

It can be easily observed that for Wavelet Resolutions up to r^x, 2 rmci+1 coefficients have to be 
calculated per cell per field component instead of one component in the conventional F.D.T.D. The 
derived gain is that the new algorithm has an improved resolution by a factor of 2 rmai+1 that can 
vary from cell-to-cell depending on the field variations and discontinuities. In addition, MRTD can 
offer a significantly better E x field resolution close to P.E.C.’s through the 0-Resolution Wavelet 
Double term with a negligible computational overhead per P.E.C. (the second 0-Resolution term). 
Conventional F.D.T.D. assumes a constant zero E x distribution half-cell on either side of the P.E.C. 
by zeroing out its amplitude at the P.E.C. cell. 
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n.2 Field Reconstruction 

The alternating nature of the wavelet functions guarantees the improved time-domain resolution of 
the MRTD scheme. Assuming that the H y scaling and wavelet (0 to resolutions) coefficients 
for a specific cell i have been calculated for t = mAt, two values can be defined for the domain 
[(i — 1. 4- (*Y — l)2“ rmd *)A z, (i + i r 2~ Tmax )Az] of each wavelet coefficient ^ maX}tr of the maximum 
resolution As a result, a total number of 2 rma£+1 subpoints/cell at the positions: z — (i — 1 . + 
(if — 0.5) 2~( rma * +1 ))Az, for ir = 1,.., 2 rma * +1 can be used for the total field value reconstruction: 


+Ci m - 0 .5 M 


where: i{. = INT(2 r z) + 1, and 


Cl 


+1 if i T < 2 Tmax 
-1 if i T > 2 rmax 


(7) 


A similar expression can be derived for E x component. It is obvious, that only 2 + Tmax coefficients 
are required for the field reconstruction at each subpoint. 


H.3 Dynamic Adaptivity - Thresholding 

The fact that the wavelet coefficients take significant values only for a small number of cells that 
are close to abrupt discontinuities or contain fast field variations allows for the development of a 
dynamically adaptive gridding algorithm. One thresholding technique based on absolute and relative 
thresholds offers very significant economy in memory while maintaining the increased resolution in 
space where needed. For each time-step, the values of the scaling coefficients are first calculated for 
the whole grid. Then, wavelet coefficients with resolutions of increasing order are updated. As soon 
as all wavelet components of a specific resolution of a cell have values below the Absolute Threshold 
(that has to do with the numerical accuracy of the algorithm) or below a specific fraction (Relative 
Threshold) of the respective scaling coefficient, no higher wavelet resolutions are updated and the 
simulat ion moves to the update of the wavelet coefficients of the next cell. The same thresholding 
procedure is performed for both E x and H y components, of the respective medium(s). In this way, the 
execution time requirements are optimized, since for areas away from the excitation or discontinuities, 
only the scaling coefficients need to be updated. This is a fundamental difference with the conventional 
F.D.TJD. algorithms that cannot provide a dynamical time- and space- adaptivity even with grids of 
variable cell sizes (static adaptivity). 

11A Multi-Time-Stepping Implementation 

As it has been reported in [9], the maximum time step value that guarantees numerical stability for 
maximum wavelet resolution is 


At<rm “ > = 55^1 


( 8 ) 




where c is the velocity of the light in the simulated medium and At, A z are the time-step and the 
cell size respectively. The dynamically changing gridding that was described in the previous section 
allocates different values of maximum wavelet resolution throughout the grid for every time-step, thus 
deriving different values of stable time-steps from cell to cell. The easiest way to guarantee stability 
would be to identify the maximum used wavelet resolution and use the respective time-step. That 
would lead to a huge computational overhead of no practical gain especially for cells that a few or 
even no wavelet resolution is needed. On the other side, if different time-steps were to be used, the 
calculation of coefficients updated more often than the neighboring cells (high wavelet resolutions 
and smaller time-steps) would require the efficient interpolation of the values that are updated less 
often (larger time-steps). To simplify this procedure, the used time-steps in the grid are defined in 
powers of 2 starting from the smallest time-step (cells that use the highest wavelet resolution rmax), 
Af( rmai ). For example, the used time-step for a cell that requires the wavelet calculation up to the 
r use < rmax resolution would get the value A= A 2 rmax ~~ r ' iae . 

After identifying the appropriate time-steps for each cell, a second-order backward interpolation is 
used to calculate field values for intermediate time instants. For example, if the calculation of E x 
coefficients in one cell is performed with time-step 8t = Aand the calculation of H y in the neigh¬ 
boring cell is performed with a larger At = A= 8t 2 ri-r2 , there is the need for the calculation of 
Hy in 2 ri “ r2 subpoints between mAt and (m-f l)At for each time-step m. The interpolation process 
can be expressed as: 

i* t Hy = [(0.5 + (i M - 0.5)£)(1 + 0.5((4it - 0.5) A _ 0.5))] m+0 . 5 H y 

- [((<*,* - 0.5)^ - 0.5)(1 + (i int - 0.5)^ + 0.5)] m -o.sH y 

+ 0.5[(i int -0.5)^-0.5][(i int -0.5)^ + 0.5] m _ 1 . 5 H s , (9) 

for i int = 1, ^(= 2 ri " ra ) and can be applied to scaling and wavelet components. The use of the 
second-order scheme provides stability for thousands of time-steps. Using linear time interpolation 
was found to lead to instabilities and increased reflection error at the interface of the different time- 
steps. Though the interpolation process adds a computational overhead by requiring the storage of 
the coefficients for three time-steps, it improves significantly the requirements in execution time by 
performing the simulations at the maximum allowable time-step everywhere in the grid. 

II.5 Validation Case 

To validate the above approach, the MRTD algorithm was applied to the simulation of the TEM 
propagation of a Gaussian excitation at z — 200A z up to 3 GHz inside air dielectric (e r = fi T = 1) 
for 1,000 time-steps with size At = 0.95At rmo1 for each cell’s maximum used wavelet resolution 
rmax and 400 cells with A z = 2.5cm. Wavelets up to 2-Resolution are enhanced wherever needed. 
A relative threshold of 10“ 4 and an absolute threshold of 10 -6 offer an approximation error smaller 
than 0.4%. Fig-(2) which displays the reconstng$e£ E x field distribution at t — 500 Az demonstrates 






the fact that only 24 cells need to calculate the Wavelet Coefficients. Everywhere else E x is close 
to zero and shows no variation; thus it requires the calculation of only scaling coefficients. (Memory 
Compression=94%). The fact that an interpolation scheme is used for the time-stepping allows for a 
time-step aspect ratio of 2 3 = 8 : 1 in these two areas. In this way, the memory compression gain is 
transfered to the execution time as well without affecting the simulation accuracy. Fig. (3) which is a 
magnification of Fig. (2) for the area of increased resolution, proves the ability of Haar-based MRTD 
schemes with arbitrary wavelet resolutions to provide locally magnified accuracy through the accurate 
representation of field variations at multiple intracell subpoints. This optimized algorithm has been 
expanded in 2D and 3D and simulation results of practical RF Packaging structures (Flip-Chip, 
Emebedded Passives) will be presented at the conference, intracell subpoints. 

III Conclusions 

Various computational aspects concerning the Haar-based MRTD adaptive gridding scheme have been 
discussed and guidelines for the optimization of memory and execution time requirements have been 
presented. The use of absolute and relative thresholding of the wavelet coefficients has been combined 
with a multi-time-stepping procedure and has led to further economies. In addition, an optimized field 
reconstruction procedure allows for the quick and reliable demonstration of the increased resolution 
that is offered by the enhancement of multiple resolution of wavelets with m i n i m um computational 
overhead. In this way, the Haar-based MRTD adaptive scheme has been proven to be a very efficient 
and reliable full-wave simulation tool for complex RF structures, such as RF Packaging geometries. 
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Figure 1: O-Order Haar Function Basis. 
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Figure 2: Demonstration of the Memory Compression of E x Spatial Distribution at 
t = 500Af. 
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Figure 3: Demonstration of the Variable Resolution (Multipoint Intracell Representa¬ 
tion) of E x Spatial Distribution at t = 500At. 
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Abstract: In this paper, a one-dimensional 
time-domain transmission-line model of elec¬ 
tromagnetic wave propagation in a magnetized 
plasma is developed. The discretization proce¬ 
dure of the anisotropic Lorentzian conductivity 
function is based on bilinear transforms. The 
method is validated using a simple example with 
an analytic solution and applied to two exam¬ 
ples of ionospheric propagation. 

INTRODUCTION 

The description of electromagnetic wave propa¬ 
gation in a magnetized plasma was first stud¬ 
ied using the Finite-Difference Time-Domain 
(FDTD) method [1]. In a previous paper [2], 
Transmission-Line Modelling (TLM) was ap¬ 
plied to the simulation of propagation in a mag¬ 
netized plasma using the complex exponential 
discretization scheme [3]. Although this sys¬ 
tem is adequate for an initial model, it does 
not give an accurate amplitude and phase re¬ 
sponse at high frequencies. Thus in this paper, 
the conductivity tensor is discretized using the 
prewarped complex bilinear ^-transform, which 
requires no more computational overhead and 
yields more accurate solutions. 

As in the FDTD development [1], the ap¬ 
proach is validated by comparing the plane wave 
reflection and transmission coefficients of a slab 
calculated using the time-domain method and 
analysis [4]. 


Finally, the method is applied to the sim¬ 
ulation of propagation of low frequency waves 
in the ionosphere. The launch of a pulse from 
the ground and the high altitude dispersion of a 
pulse are studied. The results confirm that the 
dispersion of a pulse electromagnetic wave in a 
magnetized plasma leads to the formation of a 
whistler [5], 

1. THEORY 

In this section, the simulation of 1-D propaga¬ 
tion in a magnetized plasma using TLM is de¬ 
veloped. For further details of the approach and 
applications to other anisotropic materials, the 
reader is referred to [2]. 

If we consider one-dimensional propagation 
in x. with the static bias field also in x, 
in a non-dielectric, non-magnetic magnetized 
plasma, the conduction current is 


Jey 


' tjW 

of' 


E y - 

Jez 


.0? 

of J 


. E . 


The conductivity tensor is 


'of Of' 

_ Vp£ o 

S -r V c v b 

o^of 

~ (s + Vc ) 2 + W? 

~U b S + J/ e 


In (2), the electron plasma frequency cj p = 
yJN q 2 /(£Qm) y where N is the number den¬ 
sity of electrons, q is the electron charge and 
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m is the electron mass. Also in (2), the col¬ 
lision frequency is v c and the gyro-frequency 
<jj b = B 0 q/m where B 0 is the static bias field. 

The TLM representation of the electric con¬ 
ductivity tensor is [2] 

g?9l Z 1 deQVc \s + V c U b 1 { . 

.9? 9? J {s + v e y + u>l [-0*5 + 1*j K ) 

In (3), 9e o = v%£qT}qA£/v c where tj 0 is the 
impedance of free-space and A£ is the space- 
step. 

The elements of the conductivity tensor are 
second-order functions of frequency, and as de¬ 
scribed in [3], may be written as the real part of 
two complex first-order functions. Also because 
we are employing the bilinear ^-transform for 
the discretization, to obtain an accurate map¬ 
ping from the s to the Z plane, the functions in 
(3) are prewarped [6] using 



where C is the damping ratio, o>o is the res¬ 
onant frequency, At is the time-step, z/^ is 
the warped collision frequency and is the 
warped gyro-frequency. The prewarping opera¬ 
tion conserves the d.c. gain and the damping ra¬ 
tio of the second-order function in the transfor¬ 
mation to the Z-domain. The prewarped com¬ 
plex form of (3) is 



where denotes the real part of a complex 
function. The discrete model of (7) follows from 
the bilinear transform s -» (2/At)(l — z' ml )/(l + 
z” 1 ) and a partial fraction expansion to obtain 


In (8), the constant tensor is 


-*{*•[] 7 


r. _ ffeO Vc At(z/ cu , — jiObyj)/ {y c — ju>b) /, n \ 

p 2 +A ti^-jubv) {0) 

Also in (8), the frequency-dependent tensor is 


'Lgf . [*-: 

z \ K \l-z^p p [jl 


geOVcAtiVcv - 

P (x/ c -M)(2 + A 

and 

a _ 2 ~ Atfocw ~ juJbw) 

*“2 + A*(*,-**■) [6) 

Incorporation of (8) into the TLM model is de¬ 
tailed in reference [2]. The time-domain algo¬ 
rithm requires the storage of 2 real numbers per 
electric field component. 

2. VALIDATION 


In this section, the approach is tested by com¬ 
paring the plane wave transmission and reflec¬ 
tion performance of a magnetized plasma slab 
calculated using the time-domain method and 
analysis [4]. Following the example studied 
in FDTD [1], the slab depth was 9mm, the 
space-step A£ = 75/zm and the time-step was 
At = A £/c. The plasma properties were = 
2?r x 50 x 10 9 , v c — 2 x 10 10 and u b = 3 x 10 11 . 
The incident wave was a delta function, polar¬ 
ized in z. The transmission coefficients for cir¬ 
cularly polarized waves were obtained from 

Trcp {u) = Ey (a/) + jEt (uj) 

T LC p(v) = Efa) - jEt(w) 



The reflection coefficients were calculated using 
7 similar expressions involving the reflected fields. 




Figs. 1 and 2 show close agreement between 
the reflection coefficient magnitudes and phases 
calculated using analysis and TLM. Similarly, 
Fig. 3 shows a similar correlation for the trans¬ 
mission coefficient magnitudes. Figs. 4 and 5 
show the transmission coefficient phases. Al¬ 
though the transmission phases are not as accu¬ 
rate as the reflection phases, the results indicate 
that the prewarped model is yielding improved 
results c.f. [2], 



Figure 3: Transmission coefficient magnitudes 




Figure 1: Reflection coefficient magnitudes 


Figure 4: LCP transmission coefficient phase 




Figure 2: Reflection coefficient phases 


Figure 5: RCP transmission coefficient phase 
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Figure 6: Geomagnetic field line 

3. PROPAGATION IN THE 
IONOSPHERE 

In the earth’s ionosphere, the plasma frequency 
and the collision frequency are functions of the 
altitude above the ground. The gyro-frequency 
is a function of both altitude and geomagnetic 
latitude. In this section, the model is applied 
to the simulation of one-dimensional propaga¬ 
tion along a geomagnetic field line as indicated 
in Fig. 6. This represents whistler mode propa¬ 
gation [5]. 

3.1. Geomagnetic field line 

In Fig. 6, the mean earth radius Rq = 6371km 
and the geomagnetic latitude of the field line at 
the earth’s surface is <j> 0 . The geocentric radius 
at geomagnetic latitude <j> is [5], 

R — Rq cos 2 <t>/ COS 2 (j)Q (14) 


In (16), N c is the electron density, N 0 is the 
maximum electron density and x is the angle 
between the sun and the zenith. The reduced 
height is given by 


~ A) 
H{A) 


where the altitude above the ground A = R — 
Rq and the altitude of the maximum electron 
density with the sun overhead is Aq = 300km. 


Also in (17), H(A) is the local scale height [8], 
given by 


H(A) = 


kT(A) 

m{A)g{A) 


(18) 


where k is the Boltzmann constant, T(A) is the 
temperature, fh(A) is the mean molecular mass 
and g(A) is the acceleration due to gravity. To a 
first approximation, the temperature as a func¬ 
tion of altitude is, 


T(A<Ai) = To + hi A 

T(A>Ai) = T^-iToc- Ti)e- kh ^ A ^ (19) 

In (19), the altitude of the temperature inver¬ 
sion is Ai — 85km, the ground temperature 
To = 282K, the low altitude temperature gradi¬ 
ent ki — — 1.102K/km, the electron temperature 
at high altitudes is = 3750K, the inversion 
temperature T* = T 0 -r h A{ and the high alti¬ 
tude temperature coefficient kh = 0.01105/km. 

The mean molecular mass in (18) is assumed 
constant at fh = 3.97 x 10 -26 kg and the accel¬ 
eration due to gravity is 


The gyro-frequency is given by 

= V b , eq (^) A + 3sin 2 (15) 

where the gyro-frequency at the equator 
a > bjeq = 27r x 880 x 10 3 . 

3.2. Electron density 

To a first approximation, the electron density 
is assumed to be a simple Chapman layer [7], 

N c = iV 0 exp {o.5 (l - ^ - secxe"*)} (16) 

c ' 559 


g(A) = g Q ^1 -h — j (20) 

where go = 9.825m/s. 

For modelling the electron density at high al¬ 
titudes, (16) must be modified to take into ac¬ 
count the fall off in electron density due to the 
plasmapause [5]. This is approximated by mul¬ 
tiplying (16) with an exponential function, i.e., 


( 21 ) 








Figure 7: Electron density 
where A p = 3500km. 

By combining (16) through (21), assuming 
the sun is overhead, i.e. x = 0, the max¬ 
imum electron density No = 10 12 m~ 3 and 
evaluating the electron density as a function 
of altitude, the profile of Fig. 7 is obtained. 
The approximation obtained is of the same 
order as the measured results for low and high 
altitudes shown in reference [7], The plasma 
frequency follows from uj p = yjN q 2 /{eo m ) 
and the collision frequency is approximately 
v c ~ N x 7 x 10 -13 [5]. 

3.3. High altitude whistler 
The high altitude dispersion of a derivative 
Gaussian pulse is shown in Fig. 8. The 
model consisted of 8000 cells of space-step 
A£ ~ 0.278km covering 5° of the geomagnetic 
field line indicated in Fig. 6 with <j>o = 60°. 
The pulse was launched at <j> = 2.5° and the 
output was sampled for 60000 time-steps at 
<f> = 1.25°. The sample point was ~556km 
from the source point and the front of the 
pulse was observed after ^0.018s, indicating 
the speed of propagation of the pulse front of 
~ c/10. This result agrees with the calculation 
in reference [8]: At the geomagnetic equator, 
for 4> 0 = 60°, uj p ~ 85.7kHz and u b ~13.75kHz 
and the propagation speed of the whistler front 



Figure 8: High altitude whistler, time-domain 
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Figure 9: High altitude whistler, frequency- 
domain 

is c3y/3ub/(%Up) ~ c/10. 

Fig. 9 shows the frequency content of the 
whistler as a function of time extracted by tim¬ 
ing the zero-crossings. The figure indicates the 
expected frequency-dependent dispersion with 
the whistler nose frequency of 0.25u>& ~3.4kHz 
[ 8 ], 

3.4. Whistler launch 

In Fig. 10, the launch of a z-polarized Gaussian 
derivative pulse from the ground into the lower 
ionosphere is shown. Also indicated in the fig¬ 
ure is the shape of the electron density profile 
(c.f. Fig. 7). The model consisted of 8000 cells 







400 


CONCLUSIONS 



Figure 10: Whistler launch 



Figure 11: Reflection coefficient magnitude of 
the ionosphere 

using a space-step of M53.5m. The arrows on 
the curves indicate the direction of travel of the 
waves. Pulse (1) is the excitation observed af¬ 
ter 400 time-steps. Pulse (2) shows the reflected 
and transmitted waves at the lower boundary of 
the ionosphere after 3000 time-steps. Pulses (3) 
and (4) show the transmitted wave scaled up by 
a factor of 10 after 30000 and 100000 time-steps. 
In Fig. 11 the reflection coefficient magnitudes 
for circularly polarized waves are shown. The 
figure indicates that only the LCP mode pene¬ 
trates the ionosphere and the RCP mode is to¬ 
tally reflected. 


This paper has dealt with the time-domain 
modelling of one-dimensional propagation in a 
magnetized plasma. The discretization scheme 
was based on a prewarped complex bilinear. Z- 
transform. Results obtained from the time- 
domain model for the reflection and transmis¬ 
sion performance of a slab agreed closely with 
analysis. In the final part of the paper, the tech¬ 
nique was applied to the simulation of propaga¬ 
tion in the earth’s ionosphere. The results ob¬ 
tained confirm that the frequency dispersion of 
a pulse travelling along a geomagnetic field line 
leads to the formation of a whistler. 
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Abstract 

Patch antennas on magnetized ferrite substrate have useful properties like steerable beam direction, 
tunable resonance frequency and circular polarization. Conventional TLM (SCN) algorithm using 
open and short circuit stubs are not sufficient to model the magnetized ferrite substrate. A new 
TLM(SCN) algorithm has therefore been developed. General state equations are used to model the 
anisotropic and dispersive permeability of the magnetized ferrite substrate. Equivalent circuit for 
the magnetized ferrite substrate are first derived from the Polder permeability tensor and then 
transformed to state equations. The new TLM (SCN) algorithm is applied to compute the reflection 
coefficient S n and the radiation pattern of a sample patch antenna on magnetized ferrite substrate. 
The algorithm is numerically stable and computed results agree with measurement 


L INTRODUCTION 

Microstrip patch antennas are widely used in communication system due to their conformal geometry and 
low production cost. Microstrip patch antennas on magnetized ferrite substrate have the further 
advantages not provided by conventional dielectric substrates. Beam direction and resonance frequency 
can be changed by varying the strength of the dc bias magnetic field [1]. Circular polarization occurs in 
the fun dam ental mode [2]. A planar antenna simulation tool based on the method of moment cannot deal 
with ferrite in magnetized state because of the anisotropic tensor permeability. In the original symmetrical 
condensed node (SCN) formulation of the TLM (Transmission Line Matrix) method, the dielectric 
properties of the media are simulated by three open circuited stubs, one for each direction of propagation 
and the magnetic properties by three short circuited stubs. Losses are simulated by three terminated stubs. 
However the dispersion in the media cannot be simulated. In this new TLM algorithm the non-linear, 
dispersive and anisotropic properties of the media are modelled by state equations [3]. In the new 
TLM(SCN) algorithm, the permittivity of the media is modelled by shunt transmission lines and an one- 
port equivalent circuit in the x,y and z directions respectively. The permeability is modelled by series 
transmission lines and an one-port equivalent circuit in the x,y and z direction respectively. The equivalent 
circuit of magnetized ferrite for the new TLM algorithm are first derived from the Polder tensor [4] as 
shown in equation (1). 
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B x Mxx ^xy Mxz &x ^ j K o 

By ~ Byx ^yy /f yz 11 y ~ ~J K M 0 Hy 
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M = MO+Z 

rMojyHo + jcoa) 
{rH 0 + jcoa) 2 - co 2 
K -coyMq 

[yH o + jcoa) 2 - co 1 
y = Gyromagnetic ratio 
Hq = D.C. Magnetic field 


a = Dancing factor 
Mq = D.C. Magnetisation 


The equivalent circuits of the Polder tensor for the new TLM algorithm are derived and shown in Fig.l. 
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Hg.l Equivalent circuits for the permeability tensor of ferrite in TLM simulation 
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The circuit elements in Figure 1 are given by 


M 0 AL 

h= l 3 = l 5= l 6 = ~Hq~ ’ ^ = ^ = 


MqAL 

ml 


„ „ a 2 +1 

Q = C 3 = —5 - 

r 2 H 0 M 0 AL 

yMpALa 
- ^3 ” 9 * 

cr +1 


, C 2 = c 4 = 


g 2 + 1 
y MqAL 


n, », . M 0 AL » _ „ _ rM 0 AZ. 

* 2 ~^- 2tf 0 « ’ 5 6 ~ « 


Gyromagnetic resonance effect is modelled by the parallel LC circuits and damping factor are modelled 
by resistors. Couplings between x,y and z directed magnetic fields are modelled by controlled voltage and 
current sources from the above equivalent circuits in Fig. 1. A set of state equations with 12 state variables 
can be derived to model the anisotropic permeability. 
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Fig. 2 Equivalent circuits for the permittivity of ferrite in TLM simulation 


For the permittivity of ferrite the equivalent circuits used in the new TLM (SCN) algorithm are also 
derived and shown in Fig. 2. The elements in the equivalent circuits of Fig. 2 are given by 


Q) “ Qj “ C 7 - (f r 
Kj-RS-Rq- 

Where e r = Relative permittivity of the ferrite 
e 0 = Permittivity of air 
5 = Loss tangent of ferrite 
AL = TLM discretisation length 


564 





A set of state equations with 3 state variables are derived from the equivalent circuits in Fig. 2 . The 
overall state and output equations given in Equation (2). 


— =Ax+Bv r +B n f(x,v r ) (2) 

v l = Cx+ Dv r 

There are to tally 15 state variables required to model the permittivity and the tensor permeability of ferrite. 
In the overall state equations (2) the inputs are v r the reflected voltages at the nodes. There are 6 output 
variables which are the six incident voltages V at the nodes. The functions f{x , i/) are used to represent 
the non-linearities in the ferrite. The inputs to the state equations are the reflected waves from the TLM 
(SCN) nodes to the equivalent circuits and are calculated using Equation (3). The TLM scattering 
equation which decouples the added effects of the media is given by equation (4). The LHS is the overall 
reflected voltages from the TLM(SCN) node and is the sum of two terms. The first term contains is the 
reflections if the media is free space and the second term is the added contribution from the media. The 
second term can be calculated if the permittivity and permeability are known and can be modelled by a 
set of state equations or equivalent circuits. 
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The state equations are then solved numerically to obtain the outputs which are the incident waves from 
the equivalent circuits to the TLM(SCN) node. From these outputs of the state equations the total reflected 
voltage from the TLM(SCN) node can be calculated using Equation (4). 
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n. TLM SIMULATION RESULTS 

A 15mm long 0.7mm wide 500 microstrip line has been fabricated on a 1mm thick ferrite substrate. The 
ferrite medium is magnetized in the saturation region using a permanent magnet placed below and above 
the microstrip line sample. The simulation parameters are M 0 =0.035 Tesla , y=221000 rad m /sec A, 
Ho=234000 A/m, a=0.0056,8^=13.69,5=0.0002, AL=0.1mm, At=0.1667ps and mesh size=47 x700x30. 
The S 2i parameter has been measured using a vector network analyser. The TLM simulation results using 
a Gaussian excitation are shown in Fig. 3. The time domain results are transformed to the frequency 
domain using the EFT. The S 2l parameter from the TLM simulation and measurement are compared and 
are shown in Fig.4. The simulated gyromagnetic resonance is in good agreement with the measurement. 
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. ._-_. Kg. 4 The insertion loss S,, of the microstrip line on magnetized 

fig. 3 TLM simulation of the microstnp line on magnetized femte fenite substrate 

substrate " 


A microstrip patch antenna on fenite substrate as shown in Fig. 5 has also been fabricated. The simulation 
parameters are Mq= 0.035 Tesla,y=221000 radm /sec A, Hq= 234000 A/m, a=0.0056, s r =13.69,5=0.0002, 
AL=0.1mm, At=0.1667ps and mesh size=l 11x200x40. The simulated time domain results are shown in 
Fig. 6. The simulated and measured S u are shown in Fig. 7. The patch antenna has a single resonance 
frequency at 8.2 GHz without magnetization. The new resonant frequencies at 6.4,7.0 ,and 8.75 GHz 
generated by the magnetized ferrite are simulated by the new TLM algorithm and are in close agreement 
with the measured results. 



9mm 5 mm 


Height of ferrite substrate h= 1mm 
Fig. 5 Dimensions of the microstrip patch antenna 
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Fig. 6 TLM simulation of the microstrip patch antenna on magnetized 
ferrite substrate 



Fig. 7 The return loss S u of the microstrip patch antenna on 
magnetized ferrite substrate 


Far field radiation pattern of the antenna can be calculated by Fourier transform of the near field 
pattem[5], Applying the equivalence principle the tangential electric fields on six rectangular surfaces 
enclosing the radiating patch antenna can be used as the near fields. The radiation efficiency of the patch 
antenna in Fig. 5 near the resonant frequency is so low that the measurement of the radiation pattern is 
buried in noise. Another patch antenna with resonant frequency at 4.66GHz was fabricated. Radiation 
pattern of this patch antenna on magnetized ferrite substrate has been computed using the new TLM 
algorithm and is shown in Fig 8, For the polarization in the angle psi direction in spherical coordinate a 
good agreement with the measured radiation pattern can be seen up to 60 degree azimuth. For the 
polarization in the angle theta direction, the measured power is low and noisy. The simulated and the 
measured power in the angle theta direction are of the same order of magnitude. 



Fig. 8 Radiation pattern at 4.66 GHz of a patch antenna on 
magnetized ferrite substrate, angle psi = 0 degree 
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Simulation errors arise from the assumptions used in the formulation of the Polder tensor. The 
formulation assumes a completely saturated magnetization state, normal and uniform applied magnetic 
field and a constant damping factor at all frequencies. All the above assumptions are only approximations 
of the real magnetic properties of ferrite. Hysteresis loss is not included. The Polder tensor formulation 
of the ferrite permeability does not include the effect of the residue crystalline anisotropy field which cause 
gyromagnetic absorption at below 4GHz. Error in the measurement of the magnitude of the dc bias 
magnetic field causes differences in simulated and measured resonant frequencies. The space and time 
discretization used in the TLM method cause numerical dispersion and this also results in error in 
simulated resonant frequencies. Copper loss and the discontinuities at the coaxial to microstrip junction 
have not been included in the TLM simulation. Imperfect absorbing boundary condition used in TLM 
simulation can affect radiation pattern computed. All the above factors account for the differences in the 
simulated and measured results. 


EL CONCLUSIONS 

A new TLM (SCN) algorithm for ferrite media has been developed using state equations to model the 
media which is anisotropic, dispersive and non-linear. Time domain state equations models of the 
permeability of ferrite have been derived from the frequency domain Polder tensor. The new algorithm 
is numerically stable. The simulation results of a micro strip line and a patch antenna on magnetized ferrite 
medium in saturation region confirm the validity of the new algorithm. 
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Abstract 

In this paper we report on the practical use of matched layer (ML) and perfectly matched layer 
(PML) absorbers for simulating planar microwave circuits using the SCN-TLM method. It emerges, 
that a recently proposed PML-SCN node suffers from serious stability problems when applied to 
layered three-dimensional problems and its practical use is hence very limited. However, revisiting the 
well-known matched layer technique, it emerges that a parabolic conductivity profile is not the best 
choice to achieve optimum absorption in contrast to common belief. Rather, the optimum conductivity 
profile depends on the number of matched layers. In the cases of 5 and 10 matched layer, a linear 
conductivity profile gave the best results in terms of minimum return loss, whereas for 15 matched 
layer, a steep profile witch p = 2.4 gave the best results. Furthermore, we show that the main source 
of reflection in the matched layer does not come from parasitic numerical reflections due to the abrupt 
change in conductivity, but from the differing dispersion in neighbouring media with different media 
parameters. 


1 Introduction 

When simulating planar microwave circuits it is necessary to truncate the computational domain by 
absorbing boundary conditions to eliminate unphysical reflections from truncating the mesh. For this, 
a number of absorbing boundary conditions have been proposed and intensively studied [1,3,10,11]. 
Examples are boundary conditions obtained by discretizing one-way analytical conditions derived for the 
analytical wave equations [10] and Taylor’s expansion of the plane wave solution [3]. Another approach 
is the use of discrete Green’s functions [7], 

Recently, Berenger suggested truncating the computational domain by an artificial medium known 
as perfectly matched layer (PML), which achieves a much better absorption than previously suggested 
absorbing boundary conditions [2]. For the use with the TLM method with symmetrical condensed node, 
there have been different approaches of implementing the PML. The first reported use of PML in TLM 
was given by Eswarappa and Hoefer where a FDTD implementation of PML was interfaced with the 
TLM computational domain [6]. 

A modified SCN simulating PML media was proposed by Dubard and Pompei [4]. Although 
the authors report various applications to three-dimensional antenna problems, in view of the results 
presented in this paper, it seems that this TLM PML node is numerically extremely critical which 
limi ts its practical use tremendously, especially when the PML layers are supposed to truncated layered 
inhomogeneous media. 
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Later, another u n ified PML-SCN algorithms were proposed by Ney et al. [11]. However, when 
using uniform PML-SCN algorithms, the authors also report stability problems of their implementation 
which also limit the practical use of their PML implementation. 

For this, we revisited the well known matched layer technique as this Maxwellian absorber is 
absolutely stable as it consists of common symmetrical condensed nodes. Optimizing the conductivity 
profile of the matched layer yields sufficient absorption for calculating S-parameters in planar waveguide 
problems. It'emerges, that in the studied examples, a linear conductivity profile yields the minimum 
return loss in contrast to common belief. 

2 Implementation of Layered Absorbers 

Planar microwave circuits commonly involve complicated layered substrates. Examples are multichip- 
module interconnects or planar inductors on Si substrates [9,12]. Considering the cross-section of such 
realistic structures, one realizes that one has a very large number of regions with different media param¬ 
eters such as permittivity, permeability and conductivity. This large number of regions is of course also 
present at the terminating walls of the computational domain. If one wants to use layered absorbers for 
such structures, each of these regions requires its own matched conditions in normal direction. Although 
it is not absolutely impossible to derive the necessary parameters for the ML or PML by hand, practi¬ 
cally, it is necessary to automize the generation of the absorbing layers surrounding the computational 
domain. 

We have implemented an automatic generator for generating these surrounding layers. The general 
algorithm of this generator shall be sketched briefly. Particularly for using PML, one can generally 
distinguish three different types of regions, i.e. plates for uniaxial PML, edges for biaxial PML and comers 
for triaxial PML. Our generator works list oriented, i.e. for each type of possible PML region a separate 
list is administrated. This allows the flexible declaration of layered absorbers in any combination. In 
waveguide structures, it may be sufficient to use PML at the ports of the structure. In radiation problems, 
such as characterization of microstrip antennas, PML may be desired at all sides of the computational 
domain. Of course, this generator can also be used for specifying a matched layer ter minat ion of the 
computational domain. 

3 Stability Problems of the PML-Symmetrical Condensed Node 

Although the use of the recently introduced PML symmetrical condensed node has been demonstrated 
by simulating planar antenna structures, we could not reproduce these results due to serious stability 
problems. The stability issue seems to be more profound, as others have made the same observations [8]. 
We could perfectly demonstrate the function of the PML in two dimensional parallel plate test structures. 
However, in three-dimensional problems, instabilities corrupted the solution after a few hundred time 
steps. To illustrate this, a coplanar waveguide with uniform mesh using cubic cells was terminated with 
five layers of PML, using the same parameters as Dubard et al. [5]. The geometry of the waveguide is 
depicted in Fig. 1 and the time-domain signals in Fig. 2. The other walls were terminated by simple 
matched load terminations as described in the next section. Although the PML seems to work at the 
beginning of the simulation, especially when compared to a reference solution using a very large domain 
and a waveguide truncated with matched load, after 1500 time steps the solution deteriorates. 

To probe further into the nature of these instabilities, we studied a single PML-SCN cell in a 
simple TLM test code were we could easily switch between single and double precision arithmetic. For 
this one of the ports was excited by a Gaussian pulse which is formally equivalent to a PML cell interfacing 
a larger computational domain. Using single precision, strange round-off errors can be observed as shown 
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Figure 1: Dimensions of test coplanar Figure 2: Coplanar waveguide terminated with PML. 

waveguide with uniform cubic mesh in 
pm. A l was chosen 2pm. 


in Fig. 3. However, using double precision, these round-off errors vanish, as indicated in Fig. 4. These 
results clearly indicate, that this implementation of the split field Berenger formulation of the PML is 
numerically rather problematic in case of the PML-SCN node. 

Simulating real three-dimensional problems using double precision arithmetic, the stability prob¬ 
lems still remained which indicates that not only round-off errors are responsible for the instabilities. 
Furthermore, we observed instability, when we excited four PML-SCN cells stacked as a square at the flat 
side. This indicates, that some unwanted modes in connection with round-off problems are responsible 
for the stability problems. 




Figure 3: Single PML cell with Gaussian exci- Figure 4: Single PML cell with Gaussian exci¬ 
tation using single precision arithmetic. tation using double precision arithmetic. 


4 Matched Load Technique for Irregularly Graded Mesh 

The least computational effort for modelling free space boundaries can be achieved by terminating 
transmission-lines of a TLM-SCN mesh touching the boundary plane with a reflection coefficient r e // = 0. 
Within a stub-loaded TLM mesh a local reflection coefficient ri oc ^ r e ff has to be used to achieve an 
effective reflection coefficient r e //. For isotropic materials a local reflection coefficient is determined 
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where 


rioc(r e ff,€r,fi r:{UiV,w)) = I^ + Te ff 
1 + ror eff 


( 1 ) 


r 0 = 




V^ +1 


with 


9\\ 6 (u,v,w) || Vi 

g _l € (u, u, w) JL 


( 2 ) 


The variable terms g |j and g± represent the nonnalized geometric dimension of the adjacent TLM-cell 
in the directions parallel (||) or perpendicular (jL) to the polarization of the wave amplitude Vj on 
the transmission-line to be terminated. This termination of free space boundaries is comparable with 
absorbing boundary conditions of first order. Though their absorption is relatively low, they exhibit two 
main advantages over higher order models. There is no additional memory requirement or computational 
effort. Using matched load terminations, a return loss of around -30dB can be achieved in waveguide 
problems as indicated in Fig. 6. Similar loss figures were reported in [11]. 


5 Optimizing the Matched Layer Technique 

In contrast to PML, a layered matched layer termination of the computational domain is unconditionally 
stable. For this, we studied the influence of various parameters, such as layer thickness, theoretical reflec¬ 
tion coefficient, and conductivity profile more thoroughly in order to determine the opt imum parameter 
combination. The theoretical reflection coefficient determines the maximum conductivity at the end of 
the ML. Following Ney et al. [11], we used the following conductivity profile 


<j(L) — , (3) 

where (7-max denotes the maximum matched layer electric conductivity at the outer layer N of the matched 
layer, L € 1,..., N denotes the index of the matched layer. The maximum conductivity is calculated 
according to 


Cmax — ~~ 


Ore,i(P + 1) ln(-Rp) 
2NAIZ 0 


(4) 


with Ro the theoretical reflection coefficient when the matched layer absorber is terminated by an electric 
wall and Zq the free space impedance. It emerged that using a constant a Te d = 1-0 resulted in conductivity 
values, which changed the dispersion in neighbouring layers with different media parameters such that 
a worse absorption was achieved in contrast to using lower conductivity values. A value of Ored = 0.1 
performed much better thus reducing the differences in dispersion between neighbouring layers. 

In layered media, the matched condition is as follows 


£ i fJ £* +1 


(5) 


To investigate the influence of the various parameters of the matched layer, a simple coplanar waveguide 
was studied using a uniform mesh with cubic cells. We studied the performance of the ML for two 
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Figure 5: Details of the test coplanar waveguide used for optimizing the matched layer absorber. All 
dimensions in mm. Cubic cells with Al = 0.125mm were used. The discretization is analogue to the cpw 
presented in [11]. 



Figure 6: Minimum return loss curves for 5, 10 and 15 layers ML compared to matched load termination 
for CPW on substrate with € = 9.9. For each ML thickness R$ = 1.0“ 4 yielded the best result. The 
profile coefficients are p = 1.0 for 5 and 10 ML and p — 2.4 for 15 ML. 
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Figure 13: Return loss for 10 ML with Rq — Figure 14: Return loss for 10 ML with Rq = 

1.0“ 6 for CPW on substrate with e r = 9.9. 1.0 -8 for CPW on substrate with e T = 9.9. 


different substrates, one with e T — 9.9 and the other with e r = 5.0. The details are shown in Fig. 5. All 
other open sides were again terminated by matched load terminations. 

We studied the influence of the conductivity profile, expressed by the exponential coefficient p in eqn. 3, 
on the absorption for three different layer thicknesses, i.e. five, ten, and 15 matched layers. In each case 
the profile coefficient is varied from p = 1.0 to p — 1.2,1.5,1.8,2.0,2.2,2.4. The maximum conductivity 
is in each case varied by setting Rq — 1.0“ 2 ,1.0 -4 ,1.0 -6 ,1.0"®. The ML was always terminated with a 
matched load described in the previous section. 

First, we want to compare the return loss curves for the CPW on a substrate with € = 9.9 obtained 
using 5 matched layers, i.e. Fig. 7 to 10. Surprisingly, the difference between curves with different profile 
coefficient is less pronounced as previously expected. Below 20GHz, a difference of only ifcldB can be 
recognized for different profiles. Interestingly, if Rq is chosen too low, i.e. ~ 1.0 -2 the return loss is 
only 5dB less than the one obtained by using only a matched load termination is achieved in the lower 
frequency region (see Fig.7). By inspecting Fig. 7 to 10, one can find the optimum combination of 
parameters of the ML with 5 layers as Rq = 1.0 -4 and p = 1.0 in Fig. 8. This parameter combination 
gives a uniform return loss of at least —40dB up to 20GHz. 



Figure 15: Return loss for 15 ML with Rq = 
1.0” 2 for CPW on substrate with s r = 9.9. 
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Figure 16: Return loss for 15 ML with Rq — 
1.0~ 4 for CPW on substrate with e T = 9.9. 












For comparison, the return loss for the ’best’ combination of parameters chosen using the criteria 
mentioned above, for each ML thickness is shown in Fig.6. The curve, using simple matched load 
termination is also plotted. 




Figure 17: Return loss for 15 ML with i2o = Figure 18: Return loss for 15 ML with Rq — 

1.0” 6 for CPW on substrate with e r = 9.9. 1.0~ 8 for CPW on substrate with e T = 9.9. 


Increasing the number of ML to 10 results in a return loss of almost — 50dB up to 10 GHz and 
—43dB up to 20 GHz for the optimum parameter combination Rq = 1.0~ 4 and p = 1.0, which can be 
found from Fig. 12. Again, if Rq is chosen too low, as depicted in Fig. 11, the return loss is about lOdB 
better then when using a matched load termination. Compared to using 5 matched layers, the influence 
of the profile coefficient p is more pronounced when looking at Fig. 12 to 14, especially for Rq = 1.0” 4 
where varying p results in a ±3dB change in return loss in the 0-10 GHz region. 

A similar picture presents itself when 15 layers ML are used. Comparing Fig. 15 to 18 indicates 
that only for Rq = 1.0” 4 the profile coefficient p has a pronounced influence on the return loss in the low 
frequency region. Again, the ’best’ return loss up to 20 GHz is achieved for i?o = 1.0 -4 . However, using 
a steeper profile with p — 2.4 gives the lowest overall return loss up to 20 GHz, which is almost below 
-57dB everywhere in this region. 

If we look at the return loss curves for the CPW on a substrate with approximately half the 
dielectric constant, i.e. e r — 5.0, Fig. 19 to 21, one immediately realizes that the more or less uniform 
low frequency absorption region extends up to approximately 40GHz. The absorption of the 5, 10, and 
15 ML was investigated for Rq — 1.0“ 4 and the previously mentioned values of p. Interestingly, for 5 
and 10 ML a linear profile, i.e. p = 1.0, yields the most uniform absorption, whereas in case of 15 ML a 
profile with p = 2.4 yields the most uniform absorption. 

In Fig. 22 to 24, the return losses for the CPWs on the two substrates with identical ML parameters 
are compared. It is evident, that the media parameters of the substrate have a distinct influence on the 
performance of the ML absorber. 

For the CPW with e r = 5.0 substrate, the absorption in the lower frequency region up to 40GHz 
is much better when compared to the return loss for the CPW on a substrate with e T = 9.9. In the latter 
case, the absorption in the upper frequency region is better. From this, it is clear that the differences in 
dispersion due to the different media parameters in neighbouring layered media have a stronger influence 
on the performance of the ML absorber as the parasitic reflections due to abrupt changes in conductivity. 
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Figure 19: Return loss for 5 ML with Rq = Figure 20: Return loss for 10 ML with Rq — 

1.0“ 4 for CPW on substrate with e T = 5.0. 1.0 -4 for CPW on substrate with e r — 5.0. 



Frequency [GHz] FVequency [GHz] 


Figure 21: Return loss for 15 ML with Rq = Figure 22: Comparison of return loss for 5 

1.0" 4 for CPW on substrate with e T = 5.0. ML with Rq = 1.0 -4 for CPW on substrates 

of e r — 5.0 and e r — 9.9. 



Frequency [GHz] Frequency [GHz] 


Figure 23: Comparison of return loss for 10 
ML with Rq — 1.0 -4 and p — 1.0 for CPW 
on substrates of e r = 5.0 and e T = 9.9. 
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Figure 24: Comparison of return loss for 15 
ML with Rq = 1.0 -4 and p — 2.4 for CPW 
on substrates of e r — 5.0 and e T = 9.9. 



















6 Conclusion 

In this paper we have investigated the performance of matched layer absorbers for truncating the com¬ 
putational domain in planar problems with layered substrates. The absorption of the ML was studied at 
the example of a coplanar waveguide on two different substrates with different permittivity. It was found 
that for 5 an 10 ML a linear conductivity profile yields the best absorption, whereas for 15 ML a profile 
coefficient of p — 2.4 yielded the most uniform absorption. Hence, one could say that the more ML are 
used the steeper the conductivity profile has to be. Moreover, we could show that in problems which 
involve layered media, the main source of reflection in the matched layer is due to different dispersion in 
neighbouring layers and not due to the abrupt change in conductivity, as previously thought. 

Furthermore, we found that a recently proposed PML TLM node suffers from serious numerical 
problems which inhibits its practical use in problems with layered media. 
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Abstract. In this paper -we investigate the RF behaviour of a microelectromechanical capacitive switch 
by the Transmission Line Matrix method (TLM). These microelectromechanical (MEMS) switches can 
be utilized for antenna and phased array applications for communication systems or automotive radar. 
The switch under study consists of a thin metallic membrane connected to the ground of a CPW line 
which - when actuated by a dc voltage - forms a virtual short with the center conductor of the CPW. It 
is demonstrated how the TLM algorithm can be applied for the calculation of the scattering parameters 
of both switching states.The crucial point for the analysis of those structures is the aspect ratio of the 
geometrical dimensions. Especially in the off-state, where the virtual short is realized by a 0.1 /im thin 
dielectric layer between the two electrodes, the discretization becomes problematically. Here the time 
domain analysis can solve this problem very elegantly by modeling the thin layer as a thick anisotropic 
medium with a very much higher dielectric constant in vertical direction. However, due to highly dispersive 
structure, a post-processing of time domain signal is necessary to avoid long calculation times and to 
suppress unwanted reflections at the not ideal absorbing boundary at the output planes. The signal is 
truncated suitably and then prolonged analytically by estimating the truncated part by a least square 
parameter estimation using AR models. The calculated results of a metal membrane switch shew very 
good agreement to results obtained by other numerical methods. 


1. Introduction 

Since the first demonstration of a MEMS switch in 1971 [1] vivid research activities in this area have 
started due to the increasing demand on miniaturized elements and circuits. Rapid advances in bulk 
and surface micromac hining techniques have enabled the fabrication of more and more complex three- 
dimensional components with great reliability. Different forms of MEMS switches in shunt and series 
configuration, developed mostly for phased array antenna applications, have been proposed recently 
[2, 3, 4, 5, 6]. 


Nitride film (0.1 pm) 



Buffer layer 


Figure 1 . Geometry of a MEMS capacitive switch in on -and off position 
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In comparison with other switches, realized by PIN diodes, e.g., MEMS capacitive switches possess a 
better isolation and insertion loss, they exhibit a great linearity and the power consumption is zero in the 
activated position. Certainly there are some drawbacks like the the switching speed or the comparatively 
high swit ching voltage. Since the fabrication of those switches is a demanding technological process, 
there is a great need for suitable design tools for a numerical characterization with reasonable effort 
combined with a great accuracy 

In Fig. 1 the basic design of a single-pole single-through Ka-band shunt switch in coplanar technique 
is depicted. Unfortunately these metal membrane switches possess some features which make a nu¬ 
merical description of the purely high-frequency behavior quite difficult, especially the calculation of 
insertion/isolation and return loss in the different switching positions. They exhibit a layered structure 
on a bulk high-resistive silicon substrate with some extremely thin layers. For this switch the thickness 
of the nitride film, which prevents sticking of the two electrodes in the off-state, is in the range of 0.1 pm. 
Also the SiC >2 buffer layer, the lower electrode and the metallic membrane are very thin compared to the 
height of the substrate, the transverse dimensions and the length of the connecting transmission lines. 
This geometry considerably affects the discretization and hence the required memory and calculation 
times for all numerical methods. 

In this contribution the TLM method is proposed for the numerical characterization of the RF behav¬ 
ior of this switch. The time domain method gives accurate results by using less memory than frequency 
domain methods. Employing an anisotropic model for the thin dielectric film avoids a too fine discretizar 
tion as necessary in other methods. A suitable post-processing of the time domain signal applying a 
least square parameter estimation prevents long calculation times. Here we use it for the numerical 
improvement of the results, whereas in [7] it has been utilized for analytic calculations. 

2. Analysis of different switching states 

2.1. The on-state. In the on-state, when the membrane is not actuated, the thin nitride film upon the 
lower electrode can be neglected, as the influence on the capacity of the bridge is marginal due to its 
small thickness. The TLM analysis is carried out using N x = 268, N y = 88 and N z = 90 nodes. The 
output ports are closed by matched impedance layers. 12000 time steps were chosen, since the Gaussian 
input pulse widens severely in the highly dispersive structure. Reflections occurring at the not ideal 
absorbing boundaries at the output ports can be suppressed in the time domain signal by filtering with 
simple window functions. The results are depicted in Fig. 3. and Fig. 4 and compared with numerical 
results obtained with the method of lines (MoL), working in frequency domain [8]. For the reflection 
coefficient a very good agreement can be observed. The slight difference in the tra nsm i ss ion factor can 
be explained by taking into account substrate and conductor loss in the MoL, which is more significant 
for low frequencies, and the radiation loss considered by TLM. However, the agreement is also quite good 
bearing in mind the low value of the insertion loss. 



! L \ 


Figure 2. Layout and longitu dinal section of the examined structure. The geometrical 
parameters of the CPW line are W — 60 /im, S ~ 45 pm, t — 2.5 pm, L — 60pm, 
hsub = 300 pm, £gi = 12.67 
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Figure 3. Return loss for the not actuated switch compared with numerical results 
obtained by the MoL 



FIGURE 4. Insertion loss for the switch in on-position 


2.2. The off-state. The analysis of the off-state is much more complicated for all numerical methods. 
The membrane is pulled down by applying an electrostatic force, forming a virtual short through the 
very thin nitride film. Here the thin film is considered by modeling the dielectric layer with the thickness 
td as anis otropic medium of thickness tm with a dielctric constant 


e T =diag{e Tx , e Ty , 


e Tz ) — diag 



e r , 



( 1 ) 
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Unfortunately the virtual short has a severe influence on the run of the time domain signal, which can 
be observed in Fig. 5. 



Figure 5. Incoming, reflected and transmitted time domain signals in the off-state 


A truncation of the the signal after 15000 time steps leads to a considerable error after transformation 
in frequency domain. This can be seen in Fig. 6. and Fig. 7 (TLM without prolongation), where 
the artificial limited bandwidth of the time domain signal causes an unphysical behavior in the lower 
frequency band and a violation of the power conservation. A higher number of timesteps enforces not 
only a considerably higher calculation time, but also a larger amount of required memory, as the length 
of the transmission lines has to be increased in order to avoid a superposition with the reflected signal. 
To overcome this problem, the truncated time domain signal is completed by an predicted behavior, 
calculated by a least square parameter estimation using AR models. 

2.3. ARMA and AR models for the S-parameter estimations. The using of the autoregressive- 
moving average (ARMA) and the autoregressive (AR) [9] models can be successfully applied for the 
linear prediction of a low-decayed impulse responses of HF devices. These models can be described for 
a m-channel process s[n], (n = 0... N) by the system of the difference equations 

dr v 

s[n] = - Y. A M B M •«[»-*] = -]£era ■ s[n -k] + u[n] (2) 

fc=l _ fc-0 _^ fc-i _ 

ARMA AR 

where A[fc], B[fc], B[fc] - m x m complex matrices of the coefficients of the models; d, r, p - orders of 
the models, u[n] - a m-channel input process. 

The matrix-shifting techniques as well as polynomial AR spectral estimation methods are usually used 
for the estimation of the complex coefficients of the models [10). So the least squares estimation method, 
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which minimizes the error between samples of a process and samples of the model, can be used for the 
determination of the linear prediction 1 x (p + 1) block matrix forward = (J, C^[l]... C f\p}) and 
backward C b [k] = (C 6 [p]...C 6 [l],I): 


C f - SS = (R'O... 0), C b ■ SS = (0... 0 R 6 ) (3) 

where 9?-^, 9£* are linear prediction error matrices and SS - for the case of the covariance approach - is 
the (p + 1) x (p -4-1) block matrix with elements 

SS[i,i] = s[n — »] • s r [n — j] (4) 

k=P+l 

For m = 1 the system of equations (3) can be significantly simplified 



where S - windowed, prewindowed or nonwindowed matrix of the process; c^, c b - the vectors of linear 
prediction coefficients. The estimations of the forward and backward poles z*, and the correspondent 
residues can be also easy made by factorization of the polynomials, which can be created from the 
parameters and c 6 , and by solving the linear equations 


(Z T Z)w = Z T (6) 

where Z - pxN Vandermonde matrix of the poles Zk- The accuracy of the models (2) depends on the 
values of d, r, p and should be optimized [9]. 

In contrast to [7] we use this method only to predict the future time response of the truncated signal. 
For this estimation only a part of the output signal is used, namely the oscillating part with decaying 
amlitude. The calculation of the prolongation is shown in Fig. 6 and Fig. 7. 



FIGURE 6. Prolongation of the reflected signal using a parameter estimation technique 
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FIGURE 7. Prolongation of the transmitted signal using parameter estimation technique method 

After a FFT of the signal into frequency domain, we observe a physically plausible run of the scattering 
parameters. In Fig. 8 it can be clearly seen, that the unphysical behavior for low frequencies has been 
corrected perfectly. Fig. 9 shows again a good correction of the behavior in the low frequency band. 
Here the comparison with the MoL can give here no final criterion for the judgement of the accuracy, as 
the MoL has to use approximate boundary conditions in the section of the membrane which can cause 
some deviations. A full discretization of the cross-section incorporating the thin layer would exceed the 
available computer memory. 



Figure 8. Return loss for the actuated switch compared with numerical results obtained 
by the MoL 
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Figure 9. Isolation of the actuated switch 
3. Conclusion 

It was shown how a MEMS capacitive switch can be analyzed effectively by TLM. The analysis of 
the on-state of the switch requires no additional means for the calculation of the scattering parameters. 
A very good consistency with results obtained by the method of lines can be stated for both switching 
states. Less memory and computation time was spent by obtaining the same accuracy. For the off- 
state a postprocessing of the time domain signal becomes necessary to avoid extreme calculation times. 
The reflection coefficient is corrected perfectly by predicting the truncated signal using a least square 
parameter estimation technique. The required amount of memory and computation time can be reduced 
drastically. 


References 

[1] K.E, Petersen, “Micromechanical Membrane Switches on Silicon”, IBM J. Res. Develop., vol. 23, pp. 376-385, July 
1971. 

[2] C.L. Goldsmith, Z. Yao, S. Eshelman, and D. Denniston, “Performance of Low-Loss RF MEMS Capacitive Switches”, 
IEEE Microwave Guided Wave Lett., vol. 8, pp. 269-271, Aug. 1998. 

[3] N.S. Barker and G.M. Rebeiz, “Distributed MEMS True-Time Delay Phase Shifters and Wide-Band Switches”, IEEE 
Trans. Microwave Theory Tech., vol. 46, pp. 1881-1890, Nov. 1998. 

[4] S. Pacheco, C.T.C. Nguyen, and L.P.B. Katehi, “Microelectromechanical Electrostatic K-Band Switches”, in IEEE 
MTT-S Int. Symp. Dig., Baltimore, USA, June 1998, pp. 1569-1572. 

[5] S. Chen, K. Suzuki, T. Marumoto, Y. Ara, andR. Iwata, “Design of Micromachined RF Switch”, in European Microwave 
Conference, Miinchen, Germany, Oct. 1999, pp. 49-52. 

[6] K.M. Strohm, C.N. Rheinfelder, A. Schurr, and J.-F. Luy, “SIMWIC Capacitive RF Switches”, in European Microwave 
Conference, Miinchen, Germany, Oct. 1999, pp. 411-414. 

[7] C. Eswarappa and W.J.R. Hoefer, “Fast S-Parameter Computation of a Microstrip Interdigital Filter Using TLM, 
Prony’s and Digital Filtering Techniques”, Journal of Numerical Modelling, vol. 9, pp. 237-248, 1996. 

[8] L. Vietzorreck, “Modeling of the Millimeter-Wave Behavior of MEMS Capacitive Switches”, in IEEE MTT-S Int. Symp. 
Dig., Anaheim, USA, June 1999, pp. 1685-1688. 

[9] S. Lawrence Marple Jr., Digital Spectral Analysis, Prentice Hall, Inc., New Jersey, USA, 1987. 

[10] M. Haardt, Efficient One-, Two- and Multidimensional Bigh-resolution Array Signal Processsing, Shaker Verlag, 
Aachen, 1997. 


586 





























Thin Wire Modeling with the TLMIE-Method 


S. Iindenmeier, C. Christopoulos*, P. Russer 


Technische Universitat Munchen, Lehrstuhl filr Hochffequenztechnik 
Arcisstrasse 21, D-80333, Munchen, Germany; phone: +49 89 289 23378 
fax: +49 89 289 23365, e-mail: lindenmr@hft.ei.tum.de 
* School of Electrical and Electronic Engineering, 

University of Nottingham, Nottingham, United Kingdom 


Abstract 

The hybrid Transmission line Matrix-Integral Equation method (TLMIE) is extended in order to in¬ 
clude the modeling of thin wire structures which are embedded into a TLM mesh. While the radiated 
field of the thin wire structures is modeled by the Integral Equation method, the incident field at the 
structures is modeled via the space discretizing TLM method. In this way it is possible to investigate 
the interaction of thin wire structures with an environment of non-uniform material using a relatively 
coarse TLM mesh compared to the size of the thin wire structure. 


Introduction 

Typical of many practical configurations in engineering is the presence of wire-like structures in the 
proximity of complex three-dimensional objects consisting of several materials. It is well known that 
integral equation (IE) methods [1,2] are well suited to the modeling of wire-like structures and that in 
contrast space discretizing methods like the FDTD method [3] and the TLM method [4,5] are well 
suited to modeling complex inhomogeneous objects. Methods have been developed for the imple¬ 
mentation of thin wire modeling with the TLM algorithm but in these methods the wires are restricted 
in their geometrical structure [6,7]. It is therefore desirable that the modeler has the option of using 
each method, the TLM method and the IE method in its best domain of application and combining it 
in a hybrid to tackle complex practical configurations. This type of hybridization was demonstrated in 
a series of publications [8,9,10,11] for the case of two or more remote regions which are modeled in 
detail using a space discretizing method, the coupling between these regions through free space being 
done using IE type formulations. 

In this paper the method is extended to include the modeling of arbitrary wire structures embedded in 
a region modeled by a space discretizing method. The benefits of this approach are that the wires can 
be modeled efficiently using an IE method without imposing the burden of modeling fine features on 
the space discretizing method which is modeling large volumetric features of the problem. In this way, 
several wires (eg. wire harnesses) may be modeled and embedded as pre-calculated elements in the 
global simulation. Also more intricate shapes such as spiral, helical antennas, twisted wires etc. can be 
pre-calculated and embedded in simulations without imposing significant computational burdens on 
the global simulation. 
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Theory 


In many EMC problems we have to investigate the electromagnetic coupling between wire structures 
and objects placed in a non-uniform environment consisting of several materials like dielectrics, lossy 
material and conductors. 




Fig.l a): Derivation of the radiated field at the boundary surface S B of the wire subregion; 
b): Excitation of the radiated field by equivalent sources on the surface Sb 

In order to obtain an accurate field modeling in the compound material we discretize the problem 
space including all the material into a discretized region where the field is modeled using the TLM 
method. Since the dimensions of the thin wire structures are far below the size of the TLM-elementary 
cells these structures can not be modeled accurately with the TLM method without employing special 
techniques. In order to achieve a highly accurate description of the wire structure in the TLM method 
we replace the wire structures by an equivalent source which is more extended in size than the thin 
wires and which is matched to the TLM mesh. This way we embed the wire in a wire subregion with 
the surface 5 B . The surface S B encloses the complete wire structure and is congruent with the bounda¬ 
ries of the ILM mesh cells. Applying the Schelkunoff Huygens surface equivalence theorem we de¬ 
rive the field at the surface Sb which yields an equivalent source to the wire radiation. 

In the complete problem space inside and outside Sb we distinguish between the incident field £* nc , 
ff nc from all sources located outside Sb and the radiated field £**, Ef from all sources located inside S B . 
As a reaction to the incident field and due to other possible sources the field £*, If is radiated from the 
wire (fig. la). We consider the surface 5 b which encloses the thin wire structures and where we define 
the field £ B , H B , which is a superposition of the incident field E B inc , H B nc and the radiated field E B , 
H b \ 


According to the Schelkunoff Huygens representation of the equivalence theorem, the tangential field 
components £ B r , Hb on the surface elements of the boundary surface S B represent equivalent surface 
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currents which are sources of radiation. According to this theorem the equivalent surface currents 
generate the radiated field E, If outside the surface 5 b which is equivalent to the radiated field of the 
thin wire structure. Inside the surface 5 b the equivalent surface currents generate no radiated field 
(fig.lb). This means that in the TLM method a source of the field E&, H 3 along the surface 5 b leads 
to the excitation of the field E, If outside the surface 5 b while inside the surface 5 b no field E, If is 
excited. 

The field distribution of the incident field is continuous and smooth and is not dependent to the wire 
structure. Hence the incident field in the time domain can be modeled by the TLM method in the 
whole problem region outside and inside the surface 5 B . 

The field Eb\ H 3 at the boundary surface 5 b we obtain from the current distribution on the wire 
structure via the dyadic Green’s function in the time domain. The current distribution on the wire 
structure we obtain via the dyadic Green’s function in the time domain, the boundary conditions at the 
wire surface and from the incident field £ inc , If nc at the wire surfaces. This is shown in the following 
section. 


Calculation of the current values on the wire structure 

On the conductor surfaces we define the unity vector« which points in the direction of the conductor. 
We define the position of a point on the conductor by its distance s from the end of the conductor. All 
points on the conductor are defined by the three dimensional coordinates r{J). 

Now we discretize the surfaces of the thin conductors along one dimension into wire segments with 
the index k. On each wire segment we define the current 4 and the tangential unity vector u The lo¬ 
cation of the wire segments is /*. The index i is the time index of the current values. We expand the 
current vectors by the basis functions b c : 


with 


k i 


b c (l l 0 >t tfj) 



for {l-l 0 )>\'A-t 0 )>\ 
else 


( 1 ) 

( 2 ) 


In every time step the actual current values 4j of the wire segments are derived via the Schelkunoff 
Huygens surface equivalence theorem [1,2], if we take into account the boundary conditions for the 
total field E c , He at the wire surface 5c. The following expression relates the current distribution 
along the conductor surfaces and the radiated field Ec , He on the conductor surfaces 5c according to 
the surface equivalence theorem [2]. 






dt 


\dt 


(3) 


R c —f c (0 ~ r c (O ^ c o 
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where 


R c = IRel = 1 rdlH 

(4) 

Ate = RJCo 

(5) 


If we insert eqs. (1) and (2) into eq. (3), we obtain the following approximation 

E r c (r(l k )JA,)=E' ck ,i =£ ]£>«/’u (6) 

/=1 1=0 


for the electric field values E'oc ,j which are defined in each time step t=jAt at the location r=r(4) of the 
wire segments k. The vectors Mkxt =(M X , M y , contain the coupling coefficient vectors of the 

current values I# for the field components (ZT X , Ef y , tfzfckij of the radiated field. 

At the surface Sc of the wire structure we obtain the total electric field E& He by superposition of the 
incident field E c mc , H c mc with the radiated field Ec\ He . The incident field at all the segments k of 
the wire structure is derived from the wave amplitudes of the TLM simulation by the mapping intro¬ 
duced in [4], 


Ea,j =E i c nc (r(l k ),jAt) = E(a c ,b c ) ; agj = (r(l k )JAt) = H(a c ,b c ) (7) 


The position of the sampling points of the electromagnetic field in the TLM mesh is different from the 
position of the sampling points on the wire structure. We calculate the incident field values at the 
sampling points of die wire structure by interpolating the incident field values of the neighbored sam¬ 
pling points of the TLM mesh. The total field at the surface of the n wire segments k is 




a Ck,j 


1=1 i =0 


(8) 


We define the unity vectors Uk, which are tangential to the wire segments k. From the boundary con¬ 
dition at the conducting surface we obtain 


E ck,r u k=o 


(9) 


and hence, with eq. (8) 


£ (M t ,,;•««* I U h-*CKI-k-t, £ 4,1 ( 10) 

1=1 [=1 i=o 

Equation (10) can be solved for the actual current values Ikj of the wire segments in every time step. 
This can be done without a matrix inversion since there is a time retardation between the different 
current values on the conductor segments [12]. This means, that the current values Ijj of the actual 
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time step j are only coupled with the field values E C kj of the same segment j=k. Hence Mk,ij= 0, if tek 
and if j=k. This yields for the actual current values 


Mlcjc,j‘ u k 


L tow •■*4. 


( 11 ) 


Calculation of the field excitation at the boundary surface 

Instead of modeling the thin wire structure with the TLM scheme we replace the thin wire structure by 
an equivalent source surface S B (fig.l). On this surface we impress the field E B , #b during the whole 
TLM simulation. We derive the field E Bi from the incident field E B mc , Hb im and from the field £* B , 
H * B which is radiated by the wire structure. 

£ B =JE B inc + £B ; H B =HiT + B r B (12) 


The incident field is modeled by the TLM method. In every time step we superimpose the radiated 
field E T B , Ef B on the boundary surfaces 5 b to the incident field. JS'b, /T b is derived from the current 
distribution along the conductor surfaces according to the surface equivalence theorem. We give 
numbers }i to all boundary surface elements. The field values Hbh of these surface elements are 
defined in the center of the surface elements. The indices j and i are the time indices at the destination 
points and at the source points respectively. At the boundary surface we obtain the radiated electro¬ 
magnetic field by 




i a 


Rdt £ dl 





<ir 

. r3 J 



(13) 


R=ra~rc(l'y^=R/c 0 



l+Af^- 

<k 


u'xR 


r 


*=T^-Tc t C 0 


(14) 


where 

R=\R\ = \r tr r e W)\ 

At = Rlc 0 

If we insert eqns. (1) and (2) into eqns. (13) and (14), we obtain the following approximation 


k=l 1=0 


(15) 

(16) 

(17) 
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( 18 ) 


Kj=t £w« 

/b=1 i=0 

where the matrices F and Q contain the coupling coefficients of the current values. 

The field values E B , H b at the ports on the boundary surfaces of the TLM-regions are mapped on the 
TLM wave amplitudes as and b B , which are defined at the transmission lines which are cut by the 
surface S B . 


a B +a r B with a B ,H r B ) ; 

b B =a% M +a r B with b r B =b\p r B ,H' B ); 

The TLM wave pulses with the amplitudes a r B and b r b are representing the source of the electromag¬ 
netic field which is radiated from the wire structure. In every time step the excitation a B and b B excites 
the total field outside the boundary surface 5 b while inside the surface only the incident field is ex¬ 
cited. 

Finally, in this method the following procedures are performed in every time step of the TLM simula¬ 
tion: In the whole problem space outside and inside the wire subregion the actual incident field is cal¬ 
culated by the TLM method. At the boundary suifaee the actual radiated field is calculated from the 
time history of the current distribution on the wire structure via the Schelkunoff Huygens theorem. 
After calculating the radiated field at the boundary surface in every time step, we can consider the 
radiated field values on the boundary surfaces as equivalent sources. Outside the wire subregion these 
equivalent sources excite the same electromagnetic field as the wire structure. The radiated field at 
the surface boundary is mapped onto the wave amplitudes of the TLM algorithm. These wave ampli¬ 
tudes are exciting the radiated field outside the wire subregion which is calculated by the TLM 
method, superposed to the incident field. Inside the wire subregion the equivalent sources are pro¬ 
ducing no electromagnetic field due to the surface equivalence theorem. Hence, inside the wire subre¬ 
gion only the incident field is modeled by the TLM method. Taking into account the actual incident 
field and the boundary conditions at the surface of the wire structure the actual current distribution on 
the wire structure is calculated via the Schelkunoff Huygens theorem from the former time history of 
the current distribution at the wire structure. 


=a(£r.fff); 

b. 2“ ) 


Conclusions 

We have presented the theoretical basis which allows a hybrid model of wires to be constructed in an 
environment consisting of complex composite bodies. The wire model is based on the Integral Equa¬ 
tion method while the rest of space is modeled by using time-domain space discretizing techniques. 
The two models are combined to form an integrated solution. The advantages of this approach are that 
the differential time-domain model is constructed without reference to the fine structure of the wires. 
An IE technique is used for the wires as it is better suited to this task. This development has the po¬ 
tential for describing efficiently complex wire structures which may be pre-calculated and inserted 
into large complex configurations. 
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Abstract. In this paper we highlight a number of factors that determine the speed of iterative time- 
discrete algorithms, exemplified by various TLM codes. One of the main determining factors of compu¬ 
tational code performance on state-of-the-art platforms is memory access and not the number of floating 
point operations per TLM node, as recently shown [3]. To show the influence of memory access, the 
performance of various TLM codes is compared on various platforms for the example of a microstrip 
via-hole connect and a simple cavity resonator. Conventionally optimized and memory-access optimized 
codes are compared. Further, strategies for optimizing memory access are discussed. In addition, the 
issue of software emulated floating point underflow handling in simulations is discussed. By exploiting 
memory acccess optimization strategies, one can achieve a speed-up of the code of up to 100%. However, 
the optimization strategies are somehow dependent upon the compiler. If memory access is suitably opti¬ 
mized, TLM codes show similar perfomance on Pentium based PCs and workstations independently of the 
employed operating system and compiler. The reason why the same code still runs faster on workstations 
than on PCs with similar clock speed is due to the higher memory bus clock frequency and more aggressive 
out-of-order execution and branch-line-prediction of the workstations. 


1. Introduction 

Conventional wisdom suggests that the performance of iterative field solving codes is mainly 
determined by the number of floating point operations per iteration. However, it was recently shown 
that on state-of-the-art platforms the limiting factor in TLM code performance is not the pure number 
of floating point operatons but memory access [3]. This is not only true for workstations but also for 
Pentium based PCs. In this paper, we present strategies to optimize code performance with respect to 
memory access. A platform independent measure for code performance is the number of processed TLM 
nodes per second. Using this measure, we compare the performance of various TLM codes on various 
platforms. Further, two canonical test problems are defined for evaluating the performance of TLM 
codes. 


2. The Role of Memory Access 

2.1. The Test Problems. To verify the findings of [3], a more complex test problem than used in [3] 
was specified. A microstrip via-hole transition on a layered substrate was considered. The structure 
is shown in Fig. 1. The structure was discretized using cubic cells of 1mm size. The E z -field below 
the microstrip was excited using a Gaussian transient. The voltage in the output microstrip line was 
monitored by computing the line integral over the E 2 -field at one point (see Fig. 1). At the bottom of 
the substrate, a perfectly conducting ground metallization was assumed. To be independent of particular 
implementations, the simplest possible boundary conditions were imposed. Only electric, magnetic or 
simple impedance boundaries were considered. The absorbing boundary conditions of the microstrip via- 
hole transition were realized by medium dependent matched loads at the boundaries of the computational 
domain. The simulation was run for 1000 time steps to ensure that the complete mesh contained non-zero 


594 







TLM pulses. This avoids overestimating the code performance, as zero multiplications are handled faster 
in the CPU than non-zero multiplications. 



FlGXJHE X. Layout of microstrip via-hole connect. All dimensions in mm. 



Figure 2. Three-dimensional view of TLM discretization obtained by tlm3dNG. 

To highlight the role of memory access, a small problem that fitted into level I or level II 
cache of the CPU was also considered. It consisted of a simple air-filled cavity resonator of size 
lOmmx 10mmx 10mm. This cavity was discretized using 10x10x10, 20x20x20, and 40 x 40 x40 cu¬ 
bic cells. Depending on the size of the CPU cache, only the 10x10x10 problem fitted into the cache, 
or on more advanced CPUs even the 20x20x20 problem fitted into the cache. For comparison, a finer 
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platform 

clock speed 
[MHz] 

RAM [MB] 

CPU 

Cache 

OS 

Memory 
Bus Freq. 

A 

HP C-160 



PA 8000 

1MB extern 

HP-UX 10.20 

120MHz 

B 

HP C-360 


500 

PA 8500 

1.5MB on chip 

HP-UX 10.20 

120MHz 

C 

PC 

350 

64 

Pentium II 

LI 32kB / L2 512kB 

Linux 

100MHz 

D 

PC 


64 

Pentium II 

LI 32kB / L2 512kB 

Windows 95 

100MHz 

E 

Notebook 

366 

128 

Pentium II 

LI 32kB / L2 512kB 

Windows 98 

100MHz 


Table 1. Data of the platforms employed for the performance study. OS stands for 
operating system. 


discretization using 40 x 40x 40 cells was also considered, to show the influence of the memory access on 
■ the performance, when the problem’s process size no longer fitted into any cache memory. 

2.2. Details of Platforms and Codes. The via-hole connect structure and the differently discretized 
cavities were run on various platforms involving various TLM codes and various operating systems. 
Thble 1 gives an overview. We compared two TLM codes, one TLM code developed at the Technische 
Universitat Miinchen, called tlm3dNG, and a commercially available code, namely MEFiSTo-3D™ from 
Faustus Scientific Corporation, Victoria, BC, Canada. The codes differ in that the former is written 
in Fortran and the latter in C++. However, for the scattering routines themselves, which is always 
procedural code, C++ is eventually C and the optimization strategies suggested in section 3 equally 
apply. Both codes use for the SCN scattering the formulation of the scattering matrix following [2], 
which requires 54 additions/subtractions and 12 multiplications per scattering event. Only 12 quantities 
need to be stored. Moreover, the MEFiSTo-3D™ program also allows using the general symmetrical 
condensed node (GSCN), which accounts for the constitutive parameters of the medium by using different 
impedances for the various arms of the symmetrical condensed node. A scattering with the GSCN requires 
72 additions/subtractions and 42 multiplications if all stubs are considered. It follows the formulation of 
Trenkic [5]. The advantage of having three additional degrees of freedom is gained at the expense of a 
slight reduction of speed by typically 30 percent. In addition we have investigated various optimization 
strategies for improving the tlm3dNG code. Details can be found in section 3. 

2.3. Results. To compare the performance of different TLM codes, we have defined a measure that is 
independent of the platform. We found that the number of scatter and connect events per second is 
particularly suitable and can be measured easily. To determine this number, we monitor the exact CPU 
time of the respective TLM process using system functions. This can easily be implemented in existing 
codes. As soon as the pure runtime of the TLM program is known, the performance in 1000 nodes per 
second (kilonodes/second) can be calculated. Simply multiply the number of nodes (TLM cells) used 
to discretize the structure by the number of time steps, then divide it by 1000 times the CPU runtime. 
The performance of the various codes when dealing with the microstrip via-hole transition on the various 
platforms is compiled in Table 2. The data for the cavity resonator, discretized using three different 
cell sizes, are shown in Tables 3 to 5. In all these tables, the first column indicates the platform, given 
in Table 1. The second column indicates the TLM code. The third column reports the performance 
values for the tlm3dNG and MEFiSTo-3D™ codes using conventional compiler optimization. In the 
fourth column, the performance values for tlm3dNG using conventional compiler optimization in con¬ 
nection with data prefetching are given, and in the fifth column the data for the tlm3dNG code using 
compiler optimization, data prefetching and implicitly processing the connect operation are given. The 
optimization strategies are described in the following section. 

Looking at the via-connect problem (Table 2) solved using the SCN, one realizes that before 
applying memory access optimization strategies, all codes show similar performance. The reason why 
the MEFiSTo-3D™ code is slightly faster, may be due to some built-in optimization features of the 
commercial Windows C++ compiler, or it may be attributed to the different structure of the TLM 
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Performance of TLM codes in kilonodes/sec for microstrip via-hole transition 


mesh size 
iterations 


120 x 120 x 35 

1000 




conventionally 

conventionally 

conventionally 

platform 

TLM-SCN 

code 

optimized 

optimized + 
data prefetching 

optimized + 
data prefetching + 
implicit 

connect operation 

A 

tlm3dNG 

668 

943 

1377 

B 

tlm3dNG 

786 

940 

1425 

C 

tlm3dNG 

675 

- 

1100 

D 

MEFiSTo-3D™ 

701 

_ 

_ 

E 

MEFiSTo-3D™ 


- 

- 


Table 2. Scattering performance of TLM codes for via-hole interconnect problem on 
various platforms and running under various operating systems, given in 1000 nodes per 
second [kilonodes/sec]. 


engines. However, when additional strategies for better memory access are used when writing the code, 
the impact is striking. Using data prefetching, the code can be made nearly 50% faster, and implicitly 
processing the connect operation produces a code nearly twice as fast! Implementing an implicit connect 
operation is thus an effective measure for accelerating the TLM algorithm. Comparing the speed of the 
tlm3dNG code achieved on workstations and PCs, one can recognize the influence of the system clock 
speed or memory bus frequency. The workstations are about 30% faster when simulating the via-connect 
problem. 

The same picture is generated when comparing the performance values for the cavity resonator. 
The performance values for the workstations A and B in Tables 3 and 4, show the influence of the cache 
memory of the CPU. The 10x10x10 cells problem fits into the cache of each CPU. The 20x20x20 cells 
problem, fits only into the cache of platform B which has a bigger cache than the others (see Table 1). 

Clearly, the process size of the 40x40x40 cells cavity problem does not fit into the cache of any 
machin e which is obvious from the performance data in Table 5. Comparing the values for this problem 
with the data for the via-hole connect structure in Table 2, one sees the influence of the homogeneity of the 
data. Having a single computational domain, as in case of the air-filled cavity, the impact of optimization 
of memory access is clearly evident. The fact that the tlm3dNG code runs faster on workstations for 
uniform data is due to the more aggressive out-of-order execution and branch-line-prediction on these 
platforms. 

3. Code Optimization Strategies 

3.1. Compile Time Optimization and Local Code Transformations. Most of the optimization 
strategies commonly applied by modem compilers focus on the throughput of operations on the core 
CPU. Local code rearrangements performed by compiler optimization exploit processor features like 
pipelined proces sing or improve cache hit rates on a local scale. However, these compiler optimizations 
cannot deal with the problem of high memory bandwidth required by the TLM algorithm. Memory 
access time as a determining factor for TLM code performance becomes clearly evident when looking at 
the speedup of more then 400 % achieved for cache size problems (Table 4). 

Therefore, code optimization needs to focus on memory access rather than the pure floating point 
performance. An optimization strategy addressing this issue is the so called ’data prefetching’ offered 
by some state-of-the-art compilers. Instead of minimizing the number of operations, this optimization 
concept inserts dummy instructions that try to preload memory contents accessed in upcoming iterations 










Performance of TLM codes in kilonodes/sec for cavity resonator 


mesh size 
iterations 


10 x 10 x 10 

10000 


platform 

TLM-SCN 

code 

conventionally 

optimized 

conventionally 
optimized + 
data prefetching 

conventionally 
optimized 4- 
data prefetching -f 
implicit 

connect operation 

A 

tlm3dNG 

1294 



B 

tlm3dNG 


2986 


C 

tlm3dNG 


- 

1852 

D 

MEFiSTo3D™ 

1667 

. 

_ 

E 

MEFiSTo-3D™ 


- 

_ 


Table 3. Scattering performance of TLM codes for cavity problem I on various platforms 
and running under various operating systems, given in 1000 nodes per second [kilo¬ 
nodes/sec]. 


Performance of TLM codes in kilonodes/sec for cavity resonator 


mesh size 
iterations 


20 x 20 x 20 

10000 


platform 

TLM-SCN 

code 

conventionally 

optimized 

conventionally 
optimized + 
data prefetching 

conventionally 
optimized + 
data prefetching + 
implicit 

connect operation 

A 

tlm3dNG 

897 



B 

tlm3dNG 

3474 

3738 

5138 

C 

tlm3dNG 

821 

- 

1470 

D 

MEFiSTo-3D™ 

1212 

- 


E 

MEFiSTo-3D™ 

808 

_ 



Table 4. Scattering performance of TLM codes for cavity problem II on various plat¬ 
forms and running under various operating systems, given in 1000 nodes per second [kilo¬ 
nodes/sec]. 


into cache memory in parallel to the ongoing calculations. The results given in Table 2 point out the 
significant improvements obtained from such optimization strategies and demonstrate the importance of 
memory access for TLM code performance. 


3.2. Global Code Reorganization. After applying all these well known optimization startegies, the 
resulting codes that follow the conventional TLM scheme with scatter (the notation according to [4] is 
used) 


and connect operation 


I h+i > — S\a k > 


\ ak +1 > = r | 6jt + i > 


( 1 ) 

( 2 ) 


spend more time executing the connect operation F than the computationally expensive scattering oper¬ 
ation S as listed in Table 6. This is again explained by memory access time. Scattering operates strictly 
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Performance of TLM codes in kilonodes/sec for cavity resonator 

mesh size 
iterations 


40 x 40 x 40 

1000 


platform 

TLM-SCN 

code 

conventionally 

optimized 

conventionally 
optimized -f* 
data prefetching 

conventionally 
optimized + 
data prefetching -f 
implicit 

connect operation 

A 

tlm3dNG 

721 

1131 

1653 

B 

tlm3dNG 

847 

1148 

1830 

C 

tlm3dNG 

761 

- 

1316 

D 

MEFiSTo-3D™ 

955 

- 

- 

E 

MEFiSTo-3D™ 


- 

- 


Table 5. Scattering performance of TLM codes for cavity problem III on various plat¬ 
forms and running under various operating systems, given in 1000 nodes per second [kilo- 
nodes/sec]. 


locally, resulting in a high cache hit rate, whereas the connect operation slows down the process due to 
accessing memory that is widespread over one-dimensional memory space. 

Further performance improvements with striking code speed-ups are only possible by reorganizing 
the underlying algorithm and changing its memory access scheme on a globed scale. One possible approach 
is the grouping of TLM operators of two subsequent time steps: 

|a* +2 >= r- s(r-s | a k >) (3) 

s* 

Defining distinct scattering operators S and S* for even and odd time steps allows to implicitly 
process the connection operation. The same principle needs to be applied to field mapping and boundary 
conditions, respectively. From a programming point of view this formulation results in a dual imple¬ 
mentation of every module with access to wave amplitudes a* differing in memory access but not in 
functionality. The boost in code performance gained from this extra implementation work is most obvi¬ 
ous for large problems as given in Table 2. Since memory access differs for the two scattering modules, 
we c anno t expect an equal execution time for S and S*. 

4. The Role of Software Emulated Floating Point Underflow Handling 

On most platforms, the CPU monitors floating point underflow, and in case of detection it switches 
to a software emulation with higher precision [1]. Obviously, this will slow down the whole process. This 
feature is typically switched on by default on most platforms. Normally, if one has a single process, 
this effect is not noticed. However, in case of distributed computing, the performance of the slowest 
process determines the performance of the whole distributed process. To illustrate this, the mesh of a 
discretized coplanar waveguide was divided into 10 slices. Each slice constituted a single process, and 
the relative scattering times of each process were measured. All tlm3dNG processes were interconnected 
using the Parallel Virtual Machine (PVM). The waveguide is schematically depicted in Fig. 3. In Fig. 4, 
the relative scattering times versus time step are plotted for each process. In the fourth slice, we can 
recognize a sudden increase in relative scattering time after approximately 50 time steps. The increase 
lasts for about 25 time steps and dies out afterwards. This increase in scattering time propagates in the 
form of a ’shock wave’ through all the slices along with the launched pulse. The explanation for this 
observation is as follows. The leading edge of the Gaussian field profile involves such low values of the 
TLM wave amplitudes that the CPU detects a floating point underflow. It switches to software emulation 
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mesh size 
iterations 


optimization 


Start-Up [sec] 
Excitation [sec] 
Output [sec] 
Boundaries [sec] 
Scat. I [sec] 
Scat. II [sec] 
Con. [sec] 


microstrip via-hole connect 


120 x 120 x 120 
1000 



Table 6. CPU runtime in sec. of program modules when simulating the via-hole 
connect with tlm3dNG for various optimization strategies on a HP 0360 workstation. 
For comparison, the pure scattering performance, i.e. the number of processed nodes per 
cpu time taken for the scatter and connect operation alone, is also given in Table 6. 


with enhanced accuracy and consequently, the scattering times strongly increase and the performance 
of the TLM code breaks down. As soon as the values of the pulses reach a certain level everywhere in 
the TLM mesh of each process, the numerical performance is as usual. This effect has a particularly 
strong impact on distributed processes, although it is common to all TLM computations. The software 
emulation of this floating point underflow has no influence on the accuracy of a TLM s i mulation and 
represents just an unwanted feature which slows down the field computation. Luckily, this feature can 
be easily switched off using compiler directives on most platforms. However, Intel Pentium processors 
seem to exhibit an inevitable preformance drop down to less than 50 % when operating at the edge of 
the defined floating point range, even if the floating point exception handling is switched off. This could 
not be observed on PCs running with AMD processors. 


5. Conclusion 

We have shown in this paper that the limiting factor in TLM code performance is indeed memory 
access. This was demonstrated by comparing the performance of various SCN-TLM codes on various 
commonly employed platforms when simulating two canonical problems. A realistic microstrip via- 
hole interco nne ct problem and a simple air-filled cavity resonator were considered. Various strategies 
for improving the data h andling in procedural code have been proposed and verified. Applying data 
prefetching strategies and implicitly processing the connect operation wherever possible, speeds up the 
code by up to 100%. If suitable optimization strategies are used, TLM codes perform nearly equally well 
on workstations and Pentium based PCs. The reason why the same code runs faster on workstations than 
on PCs with similar clock speed is due to the higher memory bus clock frequency and more aggressive 
out-of-order execution and branch-line-prediction of the workstations. It was also found that the best 
optimization strategy for a particular platform is slightly compiler dependent. Moreover, it has been 
shown that the operating system (Linux or Windows) has no influence on the performance of a code, 
if the correct optimization strategy is pursued. The influence of the floating point underflow exception 
handling, which is common to all CPUs, on the performance of TLM codes has also been indicated. 
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FIGURE 3. Schematic of coplanar 
waveguide computed by 10 inter¬ 
connected distributed slices. 
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Figure 4. Relative scattering 
performance of each slice of the dis¬ 
tributed TLM mesh. 
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Introduction 

Although many electronic design automation tools are available, these tools generally suffer from 
problems of electromagnetic field solvers related to accuracy, efficiency and speed. Many available 
solvers, for example, provide an EMI analysis that solves for electric and magnetic fields in time and 
space domain using only the coupled Maxwell curl equations rather than complete Maxwell formulation. 
These algorithms based on incomplete Maxwell equations have been shown to give acceptable field and 
extraction results for low to medium frequencies. However, remarkable numerical errors appear at higher, 
e.g. s Giga plus frequencies, and these errors increase with frequency. Numerical solutions at high 
frequencies, particularly at Giga Hertz level, diverge substantially from analytical solutions even fine 
meshes are used. This error is known as the "Gaussian Bug". For examples, in the paper "Numerical 
Errors in the Computation of Impedances by FDTD Method and Ways to Eliminate Them", IEEE Trans. 
Microwave & Guided Wave Letters, Vol. 5, No. 1, 1995, it showed examples of impedance extraction, 
e.g. for a coaxial line using the FD-TD method. The exact solution for the line impedance is known to be 
50 Ohms. As the frequency climbs, however, the computed impedance values deviate. At frequencies of 
substantially 40 Giga Hertz, the computed impedance deviates from the exact solution by about 20%, and 
at higher frequencies, the discrepancy becomes even larger. These EMI analysis tools, therefore, are not 
sufficiently accurate for use with deep sub-micron circuits (semiconductor or electronic design packages 
in which the conductors are .25 microns in width and smaller) and high-speed digital design. 

To eliminate these spurious solutions, prior art systems such as the FD-TD, FV-TD, and HPSS 
algorithms use special kinds of meshes in modeling. These methods require that the modeled circuit or 
electronics design package be divided into a very fine special mesh. For example, special "edge element" 
is used in conventional electromagnetic finite element methods. In calculations, only the tangential field 
components are included on the edge and the normal field components are ignored. Calculations 
performed using fine meshes require a significant amount of memory and also require significant 
computation time. Therefore, beside inaccuracy these prior art solutions are highly inefficient too. 
Furthermore, often the resultant coefficient matrix produced by these simulation engines may be non- 
symmetrical and non-positive-definite. Therefore, computing stability is problematic. Additionally, 
because the electromagnetic characteristics and performance of circuits and electronic design packages 
cannot be adequately simulated at high speeds, a lot of trial and error design cycles are required to 
produce each integrated circuit. Much as 60% of the silicon material is wasted in these trial-error 
processes. 

As semiconductor technology advances, the need to provide accurate EMI field solver becomes 
increasingly important. Since today high-speed integrated circuits comprising increasingly large numbers 
of transistors are being constructed with increasingly smaller width conductors. The Gaussian bugs 


* LSFEM ™: AAR patented technology 
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problem described above becomes increasingly important. Eliminating Gaussian bugs in the formulation 
level using appropriate methods will result significant computational profit in the EMI analysis. 

For many years, American Applied Research group has developed systematic R&D to identify 
and solve the Gaussian bug problem [1-2]. Now it is able to provide the new time domain and frequency 
domain EMI field solver prototype, LSFEM TD-3D in the design tool arena. In this paper, we report the 
main results obtained in forming this new EMI solver to date to show its power in electromagnetic 
compatibility computation at up to high frequencies. 


Basic Formulation for LSFEM TD-3D 

The new EMI field solver LSFEM TD-3D is based on the full first-order Maxwell's differential 
Equations (Faradays’ law. Ampere’s law. Gauss’ law (electric) and Gauss’ law (magnetic)) to simulate 
the electromagnetic field and performances of electronic design packages. The simulation engine works 
on a plurality of linear algebraic equations, which are derived from Maxwell's Equations. To derive the 
equations. Maxwell’s Equations are discretized in the time domain using appropriate numerical scheme, 
which provides an optimal approximation in time. A least square finite element method (LSFEM) is used 
to minimize the integral of the squares of the residuals over a meshed space domain, resulting in a set of 
linear algebraic equations. An iteration method is used to solve the set of algebraic equations determined 
above. Preferably, a parallel computing scheme is employed in the simulation to increase the speed of the 
overall system. Introducing parallel computing into the solution processes greatly increases the speed and 
efficiency of the overall system. 


Checking the Source Code System 

To check the correctness of the computing code, EMI examples are computed. They are firstly 
modeled to provide structural files or computer aided design (CAD) files. Then structural file 
representing the modeled example is "meshed” in the space domain by any preprocessor, using a node- 
based finite element discretization method. Preferably, the modeled circuit is "meshed" into a number of 
finite elements of varying shapes and sizes, wherein finer mesh is only employed in areas of greater 
interest. 


Input data files to the solver include problem controlling file, node file, element file, and 
boundary condition and electromagnetic load file, which could be time-dependent. In the element file, 
information regarding the shape of the element, the material of each element in the design, and the nodes 
defining each element is associated with each element. Materials used in this solver include anisotropic 
cases 


A user preferably selects the simulation module and design-test related initial and boundary 
conditions (excitations) for the simulation, time frames for the simulation, and other user-selectable 
information. The control module is provided for selecting among time domain, frequency domain and 
other simulation modules, and also to provide means for selecting time parameters. Then the EMI 
solution related to the defined boundary and initial value problems is established by using an iteration 
scheme. The output of the system preferably comprises data associating the magnetic and electric fields at 
each node in the modeled design for each time step over selected time period. Using this electromagnetic 
data base, various performances of the tested packages can be evaluated. 
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To accelerate the computing processes and enhance the efficiency, an adaptive mesh refinement 
procedure can be incorporated. To calculate a response using the iteration method, an initial guess is 
made as to the answer at former time step for each node. The difference between the results is retained as 
an error index, which can be used to change the size of the mesh locally. 

The field solver part of the code therefore provides a time domain analysis of the modeled circuit 
or electronic design package. The time domain analysis by LSFEM-TD 3D code can be performed in two 
dimensions 2D, and three dimensions 3D. Output from the simulation solver can be used to compute 
time-dependent electromagnetic interference (EMI) fields between electronic components; time- 
dependent or frequency dependent parasitic extraction for critical nets, VLSI circuits, and system-on-a- 
chip; time delays and waveform evolutions; crosstalk & crosscoupling between conductors or electronic 
elements in the modeled design; power loss evaluations; electromigration, etc. Based on the LSFEM 
computing method, the resulted coefficient matrix is shown to be always a symmetrical and positive- 
definite. Therefore, computing stability using this LSFEM TD-3D code is guaranteed as we can see in 
the tests. Furthermore, super computing efficiency in terms of speed and memory used in problem 
modeling is clearly achieved. 

Features of the results for various tests with the code including computing stability, convergence, 
correctness, accuracy, speed and efficiency will be briefly described in the following section of this paper. 


Brief Description of the Tests 

Test #1: Characteristic impedance extraction of a 50 Q coaxial line. Fig. 1 shows the results. 
Here the results are under use of a 10 x 10 sparse mesh. Fourier transforms are used to convert from time 
domain to frequency domain. In this case, the LSFEM code gives accuracy within 1 % for frequencies up 
to 40 Giga Hertz. While in the paper mentioned above in this paper, the author’s estimation for the mesh 
required to obtain accuracy within 1% is finer than 100 x 100. Therefore, we can obtain much better 
modeling power from LSFEM code to give EMI analysis. 

Test #2: Scattering by a circular cylinder. FE mesh is shown in Fig. 2(a). Comparison for the 
results regarding RCS is typically shown in Fig. 2(b). Here you can see the correctness of the LSFEM 
method and the code of LSFEM field solver. Fig. 3 shows the spurious solution given by two curl 
equations. In the near field, you can clearly see the area of spurious solution through comparison of both 
solutions by LSFEM and two curl equations. 

Test #3: Scattering by a 3D sphere. Fig. 4 shows part of the meshed area. Here we use 5400 
3D 8- node cubic elements and 6020 nodes to form the mesh. The code shows good computing stability, 
good convergence and good accuracy for thousands of time steps. Total scattering time for the signal 
analyzed reaches over 4166 pico-seconds. 


References 

1) . Bo-nan Jiang, The Least-Square Finite Element Method, Springer, 1998 

2) . C. Ouyang, Spurious Solutions in Computational Electromagnetics and the Least Square 

Finite Element Method - A Technical Review, AAR Report, No. 98-011, 1998 


Figures 


606 





Imaginary Part of tha Impedance 


Coaxial Line 



Coaxial Transmission Line: (a) Geometry (b) Field Profile (c) Construction 



10.0 20.0 30.0 

Frequency {GHz) 


55.00 


§. 50.00 

1 


10.0 20.0 30.0 

Frequency (GHz) 


Fig. I Characteristic impedance of 50 Cl coaxial line by LSFEM TD-3D 
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Fig. 3 Spurious solutions given by incomplete Maxwell formulation 
and the correct solution by LSFEM TD-3D code 



Fig. 4. FE mesh in computation for scattering by a sphere using LSFEM TD- 
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Abstract 

The characteristic-based time-domain method, developed in the computational fluid dynamics 
community for solving the Euler equations, is applied to the antenna radiation problem. Based 
on the principle of the characteristic-based algorithm, a governing equation in the cylindrical co¬ 
ordinate system is formulated directly to facilitate the analysis of body-of-revolution antennas, 
and also to achieve the exact Riemann problem. A finite difference scheme with the second-order 
accuracy in both time and space is constructed from the eigenvalue and eigenvector analysis of 
the derived governing equation. Rigorous boundary conditions for all the field components are 
formulated to improve the accuracy of the characteristic-based finite difference scheme. Numerical 
results demonstrate the validity and accuracy of the proposed technique. 

1 Introduction 

A variety of numerical methods have been extensively investigated to model antenna radiation 
problems. Basically, they can be divided into two classes: time-domain and frequency-domain 
methods. Between them, the time-domain method has received more attention recently over the 
frequency-domain counterpart. There are several reasons for its popularity. First, the parame¬ 
ters of interest to characterize an antenna generally change drastically with frequency. When the 
traditional frequency-domain methods are used for analysis, a set of algebraic equations must be 
solved repeatedly at many frequencies. This can be computationally expensive, and even become 
prohibitive, when an object of large electric size is considered. However, if the time-domain method 
is employed, one simulation will be sufficient to generate all the information in the frequency do¬ 
main. As a result, the time-domain method greatly facilitates the modeling of antennas, especially 
the wideband antennas. Another attractive feature of the time-domain method is that it solves 
the time-dependent Maxwell’s equations directly, and hence provides good physical insight into the 
radiation process. Furthermore, it removes the difficulty of handling material properties, which 
cannot be avoided easily by the frequency-domain methods. Although the time-domain method 
has to repeat the calculation for different incident angles when it deals with scattering problems, 
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this is not the case in antenna radiation problems because here the excitation is often fixed and 
the analysis usually requires one calculation. 

In computational electromagnetics, a popular time-domain method is the finite-difference time- 
domain (FDTD) method, which was developed by Yee in 1966 [1]. This method is a mid-point, 
leap-frog algorithm with second-order accuracy in both time and space domains. Since it is a 
central difference scheme, it is naturally stable. The inaccuracy induced by improper boundary 
conditions, the discretization error, and the round-off error will continuously propagate through 
the entire computational domain. Since an absorbing boundary must be introduced to truncate 
the computation domain so that the infinite space can be simulated, the accuracy and stability of 
the FDTD scheme are restricted. 

Maxwell’s equations in time domain constitute a hyperbolic partial differential system, which 
is a pure initial-value problem [2-3]. Therefore, the characteristic-based algorithm [4-9], which 
was developed in the computational fluid dynamics community for solving the Euler equations, is 
found to be equally effective for solving Maxwell’s equations in time domain. The basic approach 
of the characteristic-based method is to reduce the three-dimensional system of equations to an 
approximate Riemann problem in each spatial direction. The sequence of one-dimensional problem 
is then solved to obtain the solution to the original problem [4], The characteristic-based algorithm 
has several advantages over other time-domain methods. First, it utilizes the unique feature of 
the hyperbolic differential system, which is that the given initial values, together with any possible 
discontinuities, are continued along the characteristics [2-3]. Consequently, if one of the coordinates 
of the system equation is aligned with the direction of the wave propagation, E x H, the wave will 
continue to propagate along this direction, and hence no wave is reflected back. Therefore, it 
naturally eliminates the reflected wave from the truncated outer boundary. Second, if enforces the 
directional propagation of information for wave motion, because it conducts a detailed eigenvalue 
analysis. Forward differencing is adopted for the negative eigenvalues, and the backward one is used 
for the positive eigenvalues. This windward discretization provides a more robust stability than 
a central differencing scheme. Third, the governing equation can be easily cast into a generalized 
curvilinear coordinate system. It greatly accommodates the computation of electromagnetic fields 
around a complex scatterer. Another important feature of the characteristic-based method is that 
it can achieve a higher-order accuracy easily by using a higher-order interpolation or extrapolation 
scheme to construct the flux vector at the cell vertexes or interfaces. 

Due to the above advantages of the characteristic-based algorithm, it has been applied to 
solving electromagnetic problems in recent years [4-9]. From a survey of the literature, it is found 
that the major concern at present is the scattering problem, and there has been little application of 
the method to antenna problems. However, it should be more advantageous to use this time-domain 
method to deal with the radiation problem, because the position of the source is fixed and therefore 
it is not necessary to repeat the calculation for different incident angles. However, the application 
of the characteristic-based method to the antenna radiation problem is not straightforward if the 
high accuracy is required. In this paper, a dipole antenna and a cylindrical monopole antenna are 
analyzed using the characteristic-based algorithm. In order to model these two kinds of antenna, we 
first formulate a governing equation, which is feasible for the body-of-revolution (BOR) problem. 
With this governing equation, the exact Riemann problem is then achieved, which enhances the 
accuracy of the numerical scheme. Next, a detailed eigenvalue and eigenvector analysis is conducted 
on the governing equation. A finite difference scheme with second-order accuracy in both time and 
space is constructed. Rigorous boundary conditions for all the field components on the surface of 
the perfect electric conductor (PEC) boundary are derived to improve the accuracy. Numerical 
examples are given to validate the proposed technique. 
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2 Formulation 

If the characteristic-based algorithm is used directly for the analysis of a BOR antenna, the gov¬ 
erning equation with six unknowns in Cartesian coordinate needs to be cast into the cylindrical co¬ 
ordinate system. This coordinate transformation induces errors in the characteristic-based scheme, 
because it transforms the original problem to an approximate Riemann problem. It also introduces 
the artificial difficulty in the enforcement of the boundary condition on the surface of the antenna 
and on the outer boundary of the computational domain. In this paper, we start from Maxwell’s 
equations in the cylindrical coordinate system directly. By choosing unknowns wisely, the exact 
Riemann problem can be achieved. At the same time, the difficulty in implementing the boundary 
conditions can be removed. 


2.1 Governing equations 

For an antenna that is rotationally symmetric and excited by a rotationally symmetric source, the 
electromagnetic field is independent of the cylindrical coordinate <j>. Therefore, Maxwell’s equations 
can be expressed as two independent sets: one involves only the components D z , D p , (TM case), 
and the other involves only the components of B z , B p and (TE case). Taking the TM case as 
an example, the relevant Maxwell’s equations become 


dE p 

dz 


dE, 
dp 
_dH ± 

dz 

ldlpH^ 

P dp 


dH# 

dp ^ dt 


-£ dt +Jp 
dE, 

= e ~dT + Jz - 


Written in a flux vector form, the above equations become 


where 


and 


dV dF p dF z 
dt dp + dz 


U = 


= -J 





(i) 


( 2 ) 


( 3 ) 


( 4 ) 


F* = I pB^/p. 1, J = 

l pDp/e J 

Since the flux vectors F p and F z are homogeneous functions of degree one with respect to the 
dependent variable U, they can be expressed as the product of the dependent variables and the 
coefficient matrices, 


F p = AU, F z = B U 


where the coefficient matrices A and B are 


A = 


\ 0 

tj. 

) 0 0 
-i 0 0 



' 0 

0 

0 ' 

. B = 
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2.2 Eigenvalues and eigenvetors 

The characteristic-based algorithm conducts a detailed eigenvalue and eigenvector analysis. The 
equations in the flux vector form are correspondingly split according to the sign of the eigenval¬ 
ues. The forward difference is applied to compute the flux vectors associated with the negative 
eigenvalue, and the backward difference is adopted to evaluate the flux vectors identified with the 
positive eigenvalue. This process mimics the wave mechanism of information propagation, and 
hence provides a more robust stability than a central difference scheme. 

The eigenvalues of the coefficient matrices A and B are the same, which are given by 


A= 0,- 


l ’ y/JHe yfjie j 

It is evident that the eigenvalues are nothing but the speed of the positively and negatively propa¬ 
gating waves. 

The similarity matrices for diagonalization are constructed by using the eigenvectors as the 
column arrays as shown in the following equation 

r o ,/* -,/r 1 r i 0 o l 

V f* V nr nr 

Sa= 10 0 , S B = 0 -yr yfl (8) 

.01 1 J 0 1 1 

which leads to 

F„ = SaXSJ 1 U, F z = SbXSb'U. (9) 

Prom equations (7) and (8), it is obvious that the derived eigenvalues and eigenvectors are invariant 
with respect to the dependent variables U. As a result, the exact Riemann problem can be achieved 
with this formulation. 

2.3 Flux vector splitting 

The fundamental idea of the flux vector splitting is to process data according to the direction of 
wave propagation. The positively propagating wave is associated with the positive eigenvalue, and 
vice versa. As a consequence, the flux vectors F p and F z can be split as 

F p = A + U + A~V 

F Z = B + V + B~V (10) 


A second-order accurate windward differencing can be constructed to form difference operators 
according to the sign of the eigenvalues, 

AVi = [-3U* + 4Uj+i - U i+2 ]/2 
VU i = [3Uj - 4Uj_i + U<- 2 ]/2. 
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(14) 






Figure 1: Electric current at the fed point of a center-fed dipole antenna with a Gaussian time- 
dependent voltage source. 


The fractional-step method or the Runge-Kutta family of single-step multi-stage procedures can be 
employed to accomplish the time integration. In this paper, a single-step two-stage Runge-Kutta 
scheme is used to guarantee second-order accuracy in time. 

3 Numerical examples 

To demonstrate the validity of the proposed algorithm, a number of examples are considered here 
and the results are compared with other numerical or measured data. 

The first example is a linear, center-fed dipole antenna [10]. The ratio of the radius to the 
length of the antenna, r/L, is equal to 0.00674. It is excited by a Gaussian time-dependent voltage 
source which takes the form ofV = exp[— a 2 (t — t max ) 2 ], where a = 1.5 x 10 9 , t max — 1.43 x 10 -9 
sec. The voltage source is fed to the antenna through a gap having a width of L/20 at the center of 
the antenna. Based on the equivalence principle, the gap is filled with PEC and a surface magnetic 
current M$ is prescribed on the surface of the gap. This magnetic current is related to the voltage 
source by = V/d , where d denotes the width of the gap, that is, d = Lf 20. The outer boundary 
of the computational domain is placed at a distance of L away from the antenna along both the 
p and z directions. The calculated current at the fed point is shown in Fig. 1. By performing the 
Fourier transform on the source current and the excitation voltage, the input admittance of the 
antenna can be obtained, which is shown in Fig. 2. Clearly, the calculated result agrees well with 
the result from the time-domain integral method [10]. 

The second example considered here is a monopole antenna fed through an image plane from 
a coaxial transmission line [11], the geometry of which is shown in Fig. 3. The parameters to 
characterize the monopole antenna are the height h and the radii of the inner and outer conductors 
of the coaxial line, a and b. In this example, the ratio of b to a is 2.3, and that of h to a is 32.8. 
The antenna is excited by the incident TEM wave within the coaxial fine, which is given by 

Ep(t) = V'(t)/pln(b/a) 

= V i (t)/(2 V pr l ) (15) 
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(a) (b) 

Figure 2: The input admittance of a center-fed dipole antenna, (a) Conductance, (b) Susceptance. 



Figure 3: Geometry of a cylindrical monopole antenna fed through an image plane. 


where 77 = 601n(b/a), and 

v'(t) = exp(-t 2 /2r^). (16) 

The internal reflection of the antenna is characterized by the time constant, r a , which is equal to 
h/c. In order to compare with the measured data in [11], the ratio of r p to r a is chosen to be 0.161. 
To apply the compatibility condition, the outer boundary is placed at a distance of h away from the 
tip of the monopole and the outer edge of the coaxial line. The calculated reflected voltage in the 
coaxial line is shown in Fig. 4. The reference plane to extract the reflected voltage can be chosen 
arbitrarily within the coaxial line, as long as only the TEM mode is present. The compatibility 
condition is also implemented here to suppress the incoming wave. From Fig. 4, it can be seen 
clearly that the numerical result agrees very well with the measured data. 

Next, the input admittance of the monopole antenna is calculated. To compare with the result 
measured by Cooper [11], the ratio of 6 to a is chosen to be 3.0. The Gaussian pulse in the form 
of (19) is still employed to excite the antenna. After extracting the reflected voltage within the 
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Figure 4: Reflected voltage in the coaxial line for a cylindrical monopole antenna excited by a 1-V 
Gaussian pulse: b/a = 2.3, h/a = 32.8, and r p /r a — 0.161. 



Figure 5: Input admittance of a cylindrical monopole antenna: b/a = 3.0. 
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coaxial line, a Fourier transform is performed on it as well as on the incident voltage to obtain the 
reflection coefficient at the reference plane. From the reflection coefficient, the input admittance 
at the aperture of the monopole antenna can be obtained. The result is shown in Fig. 5, which 
exhibits a good agreement with the measured value. 

The method has also been applied to dielectric coated antennas successfully. 

4 Conclusion 

In this paper, a characteristic-based time-domain scheme is presented for antenna analysis. In 
combination with the unique feature of the BOR antenna, a governing equation is derived directly in 
the cylindrical coordinate system. It transforms the original initial and boundary value problem into 
an exact Riemann problem, and hence improves the accuracy of the characteristic-based scheme in a 
non-Cartesian coordinate system. It also facilities the enforcement of the PEC boundary conditions 
and compatibility condition. Numerical results are shown to demonstrate its effectiveness. 
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Abstract 

Solving of the electric field integral equation (EFIE) using the method of moments (MoM) 
is a powerful tool in determining the time dependent current distribution on thin-wire struc¬ 
tures. Modeling real life problems however requires consideration of the finite conductivity 
of the wires, consideration of lumped and distributed loads and the computation of the elec¬ 
tromagnetic field in the vicinity of the structures. In this paper we present an extension of a 
time domain formulation for solving the EFIE for thin wires with regard to linear and non 
linear loads and scattered field computation. 


1 Introduction 

In [1] we presented a new numerical formulation to determine the time dependent current distribu¬ 
tion on thin-wire structures excited by an arbitrary time varying field. The numerical formulation 
is based on the solution of an equation system formed by the mixed potential EFIE and continu¬ 
ity equation for thin wires. To solve this equation system we use the method of moments. The 
related current and linear charge density distributions are approximated by means of sets of basis 
functions. For the currents we use linear basis functions of triangular shape, and for the linear 
charge densities we use pulse basis functions. At wire junctions the current basis functions are 
slightly modified. They have to enable modeling of multiple connected wires without affecting the 
late-time stability of the formulation. The chosen current and charge approximations are similar 
to those in [2], which led to very good results in frequency domain. We also perform linear time 
discretization by using finite difference formulae. 

For computation of scattered and radiated fields we make use of the electric scalar potential 
and magnetic vector potential. We compute both at suitable locations and differentiate them 
numerically with regard to space and time. 

Considering loads, the EFIE used has to be modified. As long as the loads are passive elements 
the resulting matrix has to be modified only at elements of its principal diagonal. With linear 
loads the modified matrix is furthermore invariant with respect to time and has to be built up 
only once. It can then be used at every time step. With nonlinear loads matrix elements may 
change at each time step and special techniques or properties of the equation system have to be 
used in order to reduce computation time. 


619 





2 Numerical formulation 


Assuming a perfectly conducting thin wire in a homogeneous and lossless medium with constant 
permittivity and permeability, the EFIE and continuity equation can be written as 


dA 

9T + V$ 


-EL 


dl d\ 


ds + 


dt 


= 0 , 


( 1 ) 


where A and are the retarded magnetic vector and electric scalar potentials. I and A are the 
induced electric current and linear charge density on the wire, s denotes the length variable along 
the wire. As excitation we use the electric field E 1 of an incident electromagnetic wave. 

As reported in [1] we use the method of moments to solve (1). We approximate the current 
distribution with a set of triangular basis functions f n and the linear charge density distribution 
with a set of pulse basis functions g m : 


N N+l 

I(s,t) = 53 J„(f )/»»(»)» A(s,i) = 53 . (2) 

H-* X y?i— 1 


As testing functions we use delta functions for the continuity equation and pulse basis functions 
in case of the EFIE. Applying the method of moments and replacing time and space derivatives 
with finite difference formulae yield 

— A — w " = 2 + g * +1 ) » At = ~ F ™’ (3) 

where F .* is an expression for the current divergence at time step k and G n is a function of the 
electric scalar potential and the incident electric field. \p£ +1 = A£ +1 * As contains the magnetic 
vector potential which is dependent on the electric current. The indices m, n correspond with the 
testing points (MoM) for the two equations. 

To explain the Marching-On-in-Time (M-O-T) algorithm we assume that all quantities are 
known for time steps 0 < k' < k and we now want to know the values J* +1 , A** 1 . With the 
continuity equation part of (3) new charge density values A^ +1 are computed. They lead to 
the new values G„ +1 whereas the are known from the time step before. We decompose the 
terms '$ r £ +1 in a “self’ contribution that depends linearly on I k+l values, and a “non-self 5 

contribution which can be computed from known values I k '. Thus we obtain the equation 

system 


jyj jk+l __ _^ns,fc+l 


(4) 


with the currents I k+1 as unknowns. Matrix M is sparse, diagonally dominant and contains 
mainly geometry information. Furthermore it is time invariant and needs to be set up only once 
at the beginning of the simulation. It can then be used again at every time step and leads to a 
considerable saving of computation time. 

To ensure late-time stability we introduced a corrector step [1], in which the linear charge 
density X k+1 and current I k+1 values are re-computed. 
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3 Field computation 


The electromagnetic field in the vicinity of conducting structures is given by the superposition 
of the incident field and the scattered field. While we regard the incident field as excitation, 
and thus as known, we have to compute the scattered field. We do this by using the retarded 
electromagnetic potentials: 

dA 

E s = - V$ - — , B s = V x A. (5) 

Now consider a wire located on the 2 axis. At four points, which we obtain through small 
translations of the observation point in radial and axial direction, and at the observation point 
itself we compute the values for $ and A at every time step. The scattered field contributions of 
the two potentials can be specified as 


V# 


$2 - $1 ( $4 - $3 

-1- * e r H-7- 

A r Az 


Vx A 



* Q<p j 


dA 

dt 


A k+1 - A* 
At 


( 6 ) 


where the derivatives with respect to space and time have been replaced by finite difference 
formulae. 

Using the retarded potentials directly we omit any far field approximations and are able 
to compute the electromagnetic field even at observation points very close to the conducting 
structure. 


4 Wire junctions 

In [1] we used current basis functions of type (I) from Fig. 1 for wire junctions. For each wire end 
at the junction point such a basis function has to be used. As* denote the segment lengths on the 
connected wires and A U = Asj/2. 




Figure 1: Basis functions for wire junctions 

The basis functions of type (I) give good results as long as the number of connected wires is 
not to high (more than five). At higher numbers stability problems occur. With the modified 
basis functions of type (II) from Fig. 1 we have eliminated these stability problems. The difference 
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between the two types of basis functions axe the heights of the current parts at the junction point. 
They axe given by 



72,* 


( 7 ) 


where rij is the number of connected wires at this wire junction. The basis functions of type (II) 
were first introduced in [3] for frequency-domain simulations. They ensure that Kirchhoff’s law, 
one of the conditions for wire junctions, is also satisfied for the single basis function, whereas the 
basis functions of type (I) fulfill Kirchhoff’s law only when we sum up the contributions of all 
basis functions adjoining this junction point. 


5 Modeling linear loads 

Modeling loads requires a modification of the EFIE. The additional voltage along the load has to 
be considered in the numerical model. The continuity equation is not affected by the load. The 
new EFIE can be written as 




In eq. (8) we regard the load as a lumped element that can be represented as a series connection 
of resistance, inductance and capacitance. To treat (8) numerically we have to perform similar 
transformations of the EFIE as those, which led from eq. (1) to eq. (3). In particular we have to 
discretize (8) with respect to time. We use a simple forward differentiation formula instead of the 
time derivative, that comes with the inductance 

QT k jk+i jk 

L i\* L -KT-’ ® 

and take advantage of the piecewise linear time history of the current: 

(*+l)At 

i J + -/*)]. (10) 

kAt 

In eq. (9) and (10) we also separate terms that depend on not retarded (not known) currents I k+1 
from those that depend on retarded (already known) currents I k> , 0 < k! < k. Hence we obtain a 
new equation system 

M + AtR + L+^cJ 7* +1 = «r* +1 - «r ns ’* +1 (11) 


with the currents I k+1 as unknowns. R, L, and C represent matrices, that contain non-zero 
elements Ra, La , 1 /C# only at positions of their principal diagonal that correspond to places 




where load elements were located. In the geometry model the loads are positioned at the testing 
points for the EFIE. As long as the R, L , C values are linear, i.e. time invariant and as long as we 
keep At = constt the new system matrix is also time invariant. It needs to be built up only once 
and can be used at every following time step. The right hand side of the equation system has also 
changed. Besides the contributions due to the retarded potentials it contains also contributions 
related to the load elements. 

We can imagine distributed loads or wires with finite conductivity as series connections of 
several lumped loads and model them in the same way as described above. 


6 Modeling nonlinear loads 


Modeling nonlinear loads is similar to modeling linear loads. The same formalism as described in 
section 5 is applied for modifying the EFIE. Differences occur when solving the equation system 
for the currents (4) and (11) respectively. The formalism leads to matrix elements, that may 
change at each time step. The most straight forward method would be to adjust the system 
matrix at each time step and use the usual solution scheme for the equation system. But this 
would require to find the inverse of the system matrix (or perform a LU decomposition of the 
matrix) at each time step. Also the amount of storage required for the matrix is twice (original 
matrix and inverse or LU decomposition). To overcome this increased requirements regarding 
computation time and storage special schemes for nonlinear loads have been developed. A good 
overview is given in [4] and [5]. 

At this stage we will consider only a single wire antenna loaded with a nonlinear load having 
an idealized diode-like characteristic. Let Vnl be the voltage along the nonlinear load. Its value 
is then given by 


p _ j R f » Vivz > 0 > 

" L [Rr, if V NL <0. 


( 12 ) 


Since we consider only a single wire, our formulation yields for eq. (4) a matrix M that consists 
only of its principal diagonal. This is not the case with the procedure described in [4] and [5]. 
Thus we do not need a real matrix inversion or LU decomposition. We can therefore afford the 
“straight forward” method described above. Off-diagonal elements occur when wire junctions are 
involved or when the distance between two wires is smaller than c- At, where At is the time step 
used and c the velocity of light. Since even in these cases matrix M is usually sparce, a resorting 
of the matrix can yield a block structure 


( M u 


M = 


M22 


M N n ) 


(13) 


where only the M^-Blocks have non-zero elements. Adjusting the matrix like in the “straightfor¬ 
ward” method would be reduced to adjusting those blocks that contain nonlinear loads. In most 
of the cases this will result in a noticeable saving of computation time. 
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7 Numerical results 

(1) With a first example (Fig. 2) we want to demonstrate the efficiency of the chosen basis 
functions for wire junctions. The structure consists of 26 wires of 0.5 m length and 1 mm radius 
each. For every wire 19 current basis functions were used. The structure is illuminated by an 
incident electromagnetic wave whose time history is given by a single puls of the form 

E l = Eosin 2 (irt/T ), ft = lV/m ( T = 2 ns. (14) 

The current in Fig. 2 shows that the results are stable for late times. It is the electric current in 
the middle of the wire marked with C. For validation, time domain (TD) results are compared 
with frequency domain (FD) results provided by Concept (see [2]) by means of an inverse Fourier 
transform of the FD values. 




Figure 2: Wire junction with 26 wires; Current at point C. 


Fig. 3 shows the E x and H y component of the resulting electromagnetic field at point F in 
Fig. 2. While the displayed E x component is a pure scattered field component the contribution of 
the incident wave can clearly be seen at the first negative peak of the H y component. Here again 
TD results are validated by comparison with FD results. 

(2) We now consider modeling of linear loads. In Fig. 4 a R-L-C circuit is fed by a voltage 
source with the temporal behavior of a step function. The rise time of the source is t r = 1.5* 10~ 10 s. 
For the resulting current a solution can be found analytically. It is given by 

w = Jw+dz) ■ ^ 

For the numerical model we use four wires of 2 cm length. Each wire has a radius of 0.2 mm 
and was discretized using 9 segments. The voltage source is placed in the middle of wire 1 and 
the three load elements R , L , and C as a single lumped load in the middle of wire 3. 

Modeling the load we have to consider that the loop has a self inductance L s = 64.87 nH and 
therefore the load inductance used must be L\ = 1 — L s ~ 0.935 //H. The value for L s results 

from an approximation formula for the self inductance of rectangular loops [6j. 
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Figure 3: Electric and magnetic field at point F in Fig. 2. 


Fig. 4 shows the current through the lumped load. As can be seen, numerical results (TD) 
match analytical results very well. The slight oscillations in the vicinity of the first positive peak 
are due to a traveling wave between the source voltage and the lumped load. 

(3) As an example for nonlinear loads we reproduced results presented in [4]. A dipole is 
loaded at its center with a nonlinear resistor. The resistance is represented by Rf = 50 £1 in the 
forward” direction and Rj. = 5000 Q. in the “reverse” direction. 

A voltage source is applied in series with the nonlinear element. The form of the voltage is 
V(t) = IV*where g = 1.73• 10 9 s~ l and = 1-76• 10~ 9 s. To match the results given in 
[4] a dipole length of L = 1 m and a radius of a — 6.739 ♦ 10" 3 m were chosen. With this data the 
the dipole fatness parameter is found to be Q = 2 In {L/a) = 10. 29 segments were used to model 
the antenna. Fig. 5 shows the current at the center of the dipole and the broadside radiated field 
according to that current. Similar to [4] the field has been “normalized” by the radial distance 
from the dipole to the observation point, i.e. the values for E z have been multiplied with the radial 
distance. The results of Fig. 5 are essentially the same as presented in [4]. 


Uq = 10 v\ ® 


[jj? = ioon 

■£ — 1 /iH 
-L C = 10 P F 


_x 
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Figure 4: R — L — C Circuit 
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Figure 5: Current and radiated field of an antenna with a nonlinear load 


8 Conclusions 

The numerical technique developed in [1] has been improved with regard to modeling of wire 
junctions. Extensions for scattered-field computation and modeling of loads have been added. 
For the latter a modification of the used EFIE was necessary but did not affect late-time stability. 
The results presented have shown that our formulation is suitable for modeling general thin-wire 
structures. In case of nonlinear loads some extensions have yet to be made. Several other types 
of nonlinearities have to be added and computation time saving techniques for solving the current 
equation system have to be implemented. 
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Abstract 

A simple and efficient way to analyze the transient response of thin-wire antennas with arbitrarily connected passive 
linear elements is described. The method can be easily implemented in any existing code based on the solution, 
using a marching-on-in-time procedure, of the time-domain electric field integral equation for thin wires. Two 
examples of antennas designed using reactive loads are analyzed and the results compared with those obtained 
using the NEC code. 


1. Introduction 

It is well known that the broadband characteristics of thin-wire antennas can be governed by 
using continuous or discrete passive linear loadings, i.e. capacitors, inductances and/or resistances 
[l] -[3] . The transient analysis of the behavior of these loaded antennas might involve either 
frequency-domain analysis coupled with Fourier inversion or the application of direct time-domain 
techniques. The latter offers computational advantages when the antennas are used for short pulse 
radiation, and moreover, usually facilitates the understanding of the electromagnetic phenomena 
involved [4] ,[5] . 

Among time-domain techniques, the Finite Difference Time Domain Method (FDTD), which 
is based on the direct solution of the time domain Maxwell’s curl equations, is widely used due to 
its capability of dealing with complex geometries with arbitrary electrical properties [6] . However, 
the application of the FDTD to curved or arbitrarily oriented straight thin wires, even when they 
are not loaded, presents certain difficulties [7] ,[8] . On the other hand the time-domain electric field 
integral equation (TD-EFIE) solution using the method of moments in the time domain (MoM- 
TD) is well-suited for the study of thin wires embedded in a homogeneous environment [9] ,[10] and 
can also be used, via a hybrid technique [11] that combines the FDTD and MoM-TD methods, for 
analyzing complex geometries comprising thin wires in front of arbitrary inhomogeneous dielectric 
structures. 

In the following we describe a simple method that allows us, with a straightforward and 
computationally efficient modification, to extend any code based on the solution of the TD-EFIE 
for perfect electric conducting thin wires, to analyze the transient response of wires with capacitive, 
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inductive and resistive loading, connected either in series or in parallel. 


2. Time domain analysis of loaded antennas 


The TD-EFIE for the case of a perfect electric conducting thin wire, derived by enforcing 
the boundary condition on the tangential electric field over the surface of the wire, is given by [4] 
s -E\s,t) = (1) 


1 

4-7r£o 


/, 

M*) 


s • s' d r/ . .. § • R d s * R , ., 

^ cB? ?(s ’ * 


ds' 


where: s and s' are tangent vectors to the wire axis of contour F(s / ) at positions s(r) = s 
and s(r') = s'; I(s',t r ) and q{s',t f ) are the unknown current and linear charge distribution at 
source point s' at retarded time tf = t — R/c with R — |r — r'|; E*(s,t) is the field applied to 
the observation point. The charge q{s , ,t f ) can be expressed in terms of I(s\t) by means of the 
equation of continuity. 


The basis for solving (1) using the matching-point form of the MoM-TD, is the following: 
first, the wire length is approximated by N$ straight segments of length As and time is divided into 
N t temporal intervals of duration A t. Then, the current 7(s',t') is expanded into an appropriate 
set of spatial and temporal basis functions. The resulting equation is enforced at the space-time 
intervals by taking its inner product with a set of spatial and temporal weighting functions [4] , 
[9] . In general, this transforms (1) into a matrix equation, for each time step, of the form 

E) + E)-Z Ij\ j = l,...,N T ( 2 ) 

where Ij is a vector whose Ns elements are the unknown coefficients of the expansion of I(s\t') 
in terms of the chosen spatial basis functions at time tj. The Ns elements of the column vector 

E) are the tangential electric fields applied to the observation points on the wire, and E* are the 
tangential electric fields scattered at the observation points and time tj, by currents at previous 
times. Matrix Z is a matrix of interaction whose elements are time independent depending only 
on the geometry of the structure. Prom (2), Ij can be calculated, at each time step tj, in terms 
of previously computed currents, by marching on in time. 


As has been pointed out in [4] , the effect of passive loads connected in series along the wire 
can be easily included in (2) by just adding to its left hand side the unit length voltage drop, 
Eioad > caused by the passive loads in opposition to the applied field. 

Eioad(s,t) = I(s,t)Ri{s) + Li{s)^I(s,t)+g^ J I(s,r)dr (3) 

where Ri{s), Li(s) and Ci(s) are the values of the resistance, inductance and capacitance per unit 
length respectively, located at the observation point s. Expanding /(s',i') in (3) as described 
above allows the evaluation of the derivative of the current and its integral in terms of the 
coefficients of the current expansion. A subsequent application of the point matching form of 
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the MoM leads to a matrix equation of the form (2) where the effect of the resistive loading is 
included in matrix Z [12] and the reactive loading in Z and in Ej. Nevertheless, the extension 
of (3) to the case where the passive loads are connected in parallel is much more complicated 
because additional unknowns, corresponding to the current through each branch, are necessary. 


To overcome this, we describe a simple technique that allows us, without introducing further 
unknowns, to analyze passive loads connected either in series or in parallel. The technique consists 
of replacing, at each time step, the connection of loads in a specific segment of length As, by 
an equivalent configuration formed by a resistor in series with a source voltage. To this aim, 
we start with the expressions of the voltage drop V? +l across a segment of length As loaded 
with a capacitor, C, and the current If +1 through a segment of the same length loaded with an 
inductance, L. At the time step j + 1 they are expressed by: 


Vf +1 = Vf + ±£ + 'l C (T)dT 

1 f tj+l 

Ij + i = Ij + T / U(r)dr (4) 

Jtj 

With the use of a trapezoidal rule to evaluate the integrals, the voltage through the capacitor is 
approximated by 

V j+i — + § c 1 ?) (5 ) 

and through the inductance by 


yL _ — J L 
j+1 ~ A t* +1 


21, 


- f yh + —if) 
K * A t j) 


( 6 ) 


where At = tj+i — tj is the time step that is assumed to be constant. 


Expression (5) illustrates that, at time step j +1, a capacitor can be treated as a resistance, 
(A£/2C) , in series with a voltage source, Vf + (At/2C)lf. Analogously, expression (6) shows 
that an inductor can be treated as a resistance, (2L/At), in series with a voltage source, 
-(V 3 l + (2L/At)Ij J ). This equivalence reduces any connection between passive elements, either 
in series or in parallel, to an equivalent resistance in series with an equivalent voltage source. In 
consequence the EFIE can be rewritten as 


•£r(s,t) 


- —S 

4tt£o Jr(i') 


s-s 'd , 


s ■ R d , .. s- R , r 


ds’ 


loads 1 

+ Et + (7) 

71—1 71 

where the sum on the left hand side runs on loaded segments and where R e n and V e ’ n are, 
respectively, the equivalent resistance and source voltage at the loaded segment s n . By using 
the MoM-TD technique, equation (7) is transformed into the matrix equation 

E‘ + E) = {Z + Z L )ij ; j = 1. Nt (8) 
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where Z is a diagonal matrix that includes the effect of the equivalent loads R e along the 
antenna and does not depend on time. The contribution of the equivalent sources V e (t) along 
the antenna is included in Ej. To calculate the equivalent sources at any segment and time 
step, it is only necessary to store the value of the current and voltage at that segment at the 
previous time step. Therefore, the method neither increases computational time nor requires the 
allocation of significant extra memory. 


As an example, in the case where the passive RLC loads are connected in series, R e and 


VJ +1 become 


At 2 L 
= R+ 2C + At 

= yC - V L + (— - —)/. 

J 1 { 2C At * 3 


and for a parallel connection of the same elements 

ne __ 

4 +^ 


ye — jC _ jL , /2C _ _ MQl 

v j+i i j L j + \ At 2 L* Vj \ 4C + 1M! + 2At 

L R 

Any other combination of loads on a segment can be studied as the series combination of 
the two latter examples. 


3. Numerical results 

This section presents two examples of thin-wire loaded monopole antennas perpendicularly 
attached to a perfectly conducting ground plane with the feed point located at the junction with 
the ground plane. In both examples the antenna is excited by the gaussian pulse 

V*(t) = e-^*- 4 ") 2 (il) 

where g = 5-10 9 $ -1 and t m = 2.146/<? s and the transient response is calculated including 
the described technique in the computer program DOTIGl [9] -[10] . This program solves the 
TD-EFIE using the point-matching form of the moment method with the match points situated 
at the centres of the space-time intervals. The spatial and temporal variation of the current is 
represented by a two-dimensional Lagrangian interpolation of order two in each dimension (space 
and time) as the basis function. 

The first example is a monopole antenna of length 16.3 cm and radius 1.5 mm. The monopole 
is loaded with five reactive elements as shown in the inset of Figure 1. The loads are one inductance 
L = 1.257 10 ~ 8 H, located at the feed point, and four capacitances Ci = 2.122 10“ 12 i ? , C 2 = 1.273 
10" 12 F. and Cz — C A — 6.366 10~ 12 F, which are located at 32.1 mm, 62.3 mm, 90.5 mm and 
140.9 mm respectively from the feed point. Figure 1 shows the time evolution of the current at the 
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feed point. The real and imaginary parts of the input impedance of the antenna versus frequency 
calculated via Fourier transform using the method described and the Numerical Electromagnetic 
Code (NEC) [13] are shown in Figure 2. It can be seen that the two results are practically 
identical. 



Figure 1. Time evolution of the current at the feed point. Inset, geometry of the series loaded monopole antenna. 
(Example 1) 

As a second example a dual-band monopole antenna for cellular phones has been designed 
with operating frequency bands 824-894 MHz and 1.85-1.99 GHz. The length of the monopole 
is 10 cm and its radius 1.5 mm. The antenna is loaded with a resonant LC circuit located 4 
cm from the feed point as shown in the inset of Figure 3. The values of the inductance and 
capacitance are L=1.365 l(T 7 ff and C=5.14 10 ~ U F respectively. Figure 3 shows the evolution of 
the current at the feed point versus time, and Figures 4a and 4b the conductance and susceptance 
versus frequency, calculated using NEC and the proposed method. Again, there is an excellent 
agreement between the two sets of results. 


4. Conclusions 

This paper presents a simple and efficient method to extend any code based on the solution 
of the time domain electric field integral equation for perfectly conducting thin wires to include 
arbitrarily connected passive linear elements. The method does not require a significant increase 
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Frequency (GHz) 


Figure 2. Real and imaginary parts of the input impedance of the antenna versus frequency. (Example 1) 



Figure 3. Time evolution of the current at the feed point. Inset, geometry of the dual-band monopole antenna. 
(Example 2) 
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Figure 4a. Conductance versus frequency. (Example 2) 



Frequency (GHz) 


Figure 4b. Susceptance versus frequency. (Example 2) 
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in computational resources. For the sake of example two monopole antennas loaded in series and 
in parallel are analyzed. The results are found to coincide very well with those obtained using the 
NEC code. 
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Abstract - A Multi-Resolution Time-Domain (MRTD) technique based on Haar- 
wavelets has been applied to model the propagation of waves in an absorbing isotropic 
plasma. To validate this technique, we have computed the transmission coefficient of a 
plane wave through the plasma. 


I INTRODUCTION 

Multi-resolution time-domain (MRTD) techniques have recently been used to solve 
electromagnetic problems, using both Battle-Lemarie [1] and Haar [2] wavelets. 

In this paper, we have applied a multi-resolution technique based on Haar wavelets to 
model the propagation of waves in an absorbing isotropic plasma. To this end, we have 
discretized the convolution equation governing its behavior and solved it with a recursive 
technique [3]. We have compared the results with those obtained using a similar TLM 
formulation [4], and with theoretical results. 


H FORMULATION 

II. a. 2-D Basis Functions and Algorithm. 

The Haar wavelet formulation has the advantage over other wavelet formulations that the 
resulting equations of the algorithm are similar to the traditional FDTD equations, with the 
difference that in the multi-resolution method the equations are computed independently for 
each basis function. We can expand each field component in a two-dimensional combination 
of the Haar scaling (([)) and wavelet (\j/) functions [5]: 
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I J N 


E,fct) = IIIh n (t)- 

i-0 j=0 n=0 


n^k ‘ 4>i(x>() k (z) 

+n E U- ( t>i(xVk(z) 
+„ y E^-V i (x>t) k ( Z ) 
+J E ,Vk ¥i(xVk(z) 


(i) 


The notation is that given in [6] for two dimensions of space. We can expand the remaining 
field components (H x and H z , in a TE mode) in orthonormal basis functions. We can substitute 
each component in Maxwell’s equations, obtaining, as mentioned above, the same equations 
as the traditional FDTD (Yee's scheme), formulated independently for each basis function. 


ILb. Discretization of the Causality Relationship in Linear Dispersive Media 

In time domain and for a linear dispersive medium, the relationship between the 
displacement (D) and the electrical field (E) vectors can be expressed as follows [7]: 


D(t) = e^oEW + EoXO) * E (t) (2) 

where £ 0 is the permittivity of the vacuum, £„ is the permittivity of the medium at very high 
frequencies, %(t) is the time-domain electric susceptibility of the same medium [8], and * 
represents a convolution product. We can discretize this equation in our FDTD/MRTD mesh 
to obtain [3]: 


_ f T- k __ E+, _,E 

k D = e 0 |e„- k E+2x» — ~ 2 - > 

for each value of the spatial indices, k is the temporal index, and % m 

Electric Susceptibility (GES)” and is defined, according to Luebbers et al [9], as: 

(n+l)At 

X„= JxW 


Thus, if we discretize Maxwell's curl equation: 





(5) 


for a medium with a conductivity a, and we substitute Eq. (3) in the result, we get the 
equation describing the FDTD algorithm: 
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( 6 ) 


We must perform this algorithm for each combination of scale and wavelet functions as 
shown in Eq. (1). 


II.c. Plasma Modeling. 


Under certain conditions [8], the frequency domain susceptibility of an isotropic plasma is 
given by: 


xM= 


co: 


co(jco c - co) 


(7) 


where 00c is known as "collision frequency", and C0p as "plasma frequency". As it is 
explained in [9], an expression for the time-domain susceptibility can be found that is valid 
for every frequency, with the exception of co=0, where the relationship is not causal: 


X(t) = f—l(l-e-“ ct ) 

{®c ) 


t>0 


( 8 ) 


if we apply Eq. 4 to this expression, we obtain a GES: 


or 

y — —At- 

co. 


^co * 

p 

(0 

V c J 


(l-e-“= At )e- 
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(9) 



We observe a constant term in the equation, but, as the GES is included in the MRTD 
algorithm by means of a substraction (Xm+rXm-i), this term vanishes, and the summation can 
be computed in a recursive way: 

GCro+l - Xm-1) = 6c m - Xm-2 ) (10) 


m RESULTS 

We have simulated the propagation of a plane wave through a plasma whose 
characteristic frequencies are CGfc=co^=1000 GHz. The computational domain is 10 x 5000 
square cells, with Ax=0.42 |im. The sidewalls are magnetic (in order to support the 
propagation of a plane wave), and the end walls are placed far enough to avoid the reflection 
appearing in our simulation. The excitation is distributed between space steps 2080Ax and 
2090Ax of the computational domain. The transmission coefficient between space positions 
(2110Ax -2120Ax) and (2210Ax -2220Ax) has been calculated, and the results are shown in 
Figure 1. 



Figure 1 . Transmission coefficient through 0.42 mm of plasma: 

comparison between data obtained with MRTD and TLM. 

IV CONCLUSION. 

A MRTD technique for the simulation of the propagation of waves in absorbing isotropic 
plasmas has been developed. Haar wavelets have been employed. 

Results have been compared with those obtained with our own TLM modified algorithm 
[4] and with exact theoretical treatment. We observe a good agreement among them (the 
maximum error being around 1%), a better precision of the MRTD data being obtained, 
especially at low frequencies. The MRTD technique is particularly advantageous if wavelets 
are adaptively employed in situations where the nonuniformity of the fields varies 
dynamically, such as in the propagation of narrow pulses through a plasma. 
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Abstract. Numerical results obtained by the application of a stable marching-on-in-time solution 
scheme of the Electric Field Integral Equation (EFIE) are compared with experimental data relevant 
to the case of scattering from continuous metallic shelters with apertures. The experiments have 
been conducted in an Electromagnetic Pulse (EMP) hardware simulator, reproducing the plane 
wave illumination condition. The limits of applicability of the EFIE to the case of the transmission 
through apertures in perfectly electric conducting (PEC) bodies are discussed. 

1. INTRODUCTION 

Numerical techniques directly operating in time domain (TD), have recently received 
considerable attention in computational electromagnetics. Indeed, the availability of time-domain 
computer codes is of remarkable importance in a variety of applications in the field of 
electromagnetic compatibility, where the interest is often extended to wide frequency bands. 

Among the algorithms proposed in the literature for electromagnetic scattering analyses in TD, 
we mention the marching-on-in-time formulation of the Method of Moments (MoM). A 
shortcoming of this algorithm is its intrinsic numerical instability: the calculated induced current and 
field distributions typically exhibit spurious fluctuations of growing amplitude and eventually 
diverge. These instabilities are introduced at the system discretization stage, i.e., in the conversion 
of the Electric Field Integral Equation (EFIE) to a discrete space-time model. Similar problems 
arise in the application of the magnetic and combined field integral equations. In particular, the 
achievement of a stable numerical procedure strongly depends on how the original problem is 
discretized, both in space (size and shape of the spatial basis functions) and time (time step and 
shape of basis functions with respect to time) [1]. 

The limits of applicability of a numerically stable MoM solution of the Electric Field Integral 
Equation in Time Domain (EFIE-TD) are investigated in this paper with reference to the case of 
continuous structures realized by joining perfectly electric conducting (PEC) plates, in the presence 
of apertures in their walls. The case of PEC closed bodies has been already discussed in [2]. Here, 
simulation data are compared with results obtained from experiments performed on a set of 
canonical closed bodies with apertures. The experiments have been carried out in an open-site 
Electromagnetic Pulse (EMP) hardware simulator, reproducing plane wave illumination conditions. 
The induced current densities on the exterior surface of the metallic bodies have been measured by 
broad-band sensors, which are coupled to the magnetic fields tangential to the above surfaces. 
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Moreover, the electric field penetrating the interior region of a metallic shelter through an aperture 
has been measured by employing suitable field sensors. The measured current densities and field 
amplitudes have been therefore processed in order to eliminate the effects introduced by the 
measuring instruments chain and those of air humidity that strongly affects the results. It is worth 
pointing out that, due to the large time constants characterizing the measurement setup, 
comparisons between experimental and numerical data can be performed only if the employed 
method is stable also at late times; in our case this long term stability has been obtained by the 
analysis developed in [3]. 


2. A Stable MoM-TD formulation 

A TD version of the EFIE suitable for solving the scattering problem of interest is described in 
this section and solved through a MoM approach. Let S be the exterior surface of a perfectly 
conducting body, which can be either closed or open, illuminated by an arbitrarily polarized plane 

wave; E l denotes the incident electric field and J the surface current density induced on S. 

Since the body is a perfect electric conductor, the tangential component of the total electric field 
on S must vanish. This leads to an integro-differential vector equation in the unknown induced 
current density J . The forcing term of this equation is given by the tangential component of the 
incident electric field T4ng : 




4tt • 


dS 


-V 


— 14 \V J(r\z)dxdS 

4 ne* s R } 


— J7 1 
~ ^tang • 


(i) 


In (1), R = |r -r'l represents the distance between an arbitrarily located observation point r and a 
source point r f on S; t-R/c is the retarded time and , 0 , £ are the permeability and permittivity of 
the medium surrounding the scatterer, respectively. In order to obtain a numerical solution of 
equation (1), a suitable expansion of the unknown current is employed. In particular, the 

spatial variation of surface currents induced on the body can be accurately approximated by 
discre tizin g its surface into N r triangular patches over which a set of basis functions can be 
defined. As far as the temporal variation is concerned, a time discretization scheme can be assumed, 
provided that the time step AT is properly chosen. Hence, the surface current distribution can be 
numerically approximated as 


J{r,t)= X hn,jTj(t)f„(r), (2) 

j=-°° n ~0 

where N e is the number of edges of the triangles which model the scatterer, excluding the boundary 
edges if S is open. Each unknown coefficient I n j - represents the value of the component of the 
surface current normal to the «-th edge at the instant jAT■ 

The basis functions f n (r ) that have been utilized are known as "roof-top" functions [4]; they 
are defined over each triangle pair that discretizes the body surface (triangular patch modeling). 
Basis functions Tj (t) with a triangular shape have been commonly used to discretize the temporal 
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evolution [1]. However, in order to guarantee the algorithm stability according to the criteria 
defined in the next section, different interpolating functions have been employed here. Specifically, 
pulses ensuring quadratic approximation over temporal sub-domains have been used. In particular, 
the second order polynomial used to describe time evolution over each sub-domain is an arc of the 
parabola passing through the samples at: t-(j - 1) AT, t = jAT , and t — (j + \)AT . The result is 
a continuous piece-wise quadratic function, with a piece-wise linear derivative, which is 
discontinuous at the integer multiples of AT 9 except when the function to be interpolated is exactly 
linear or quadratic. 

From (1) and (2), adopting a testing procedure as that used in the frequency domain approach 
[4], the following set of equations is obtained at the instant tj = jAT : 

1 dA(r*,tj) r T 

f(PS*+pr) — [<p(r n c+ ,tj)-$(fr,tj)h 

2 d t (3) 

=^(pS + +pr)-E} ang (r n *,tj) , « = l,2,.JVe. 

In the derivation of (3), testing functions coincide with basis functions (Galerkin's method). 
Moreover, the vector potential A is evaluated at the center r* of each edge, while the scalar 
potential (p is calculated at the centroid r n ~ of each triangle . Finally, p n ~ denotes the same 
centroid with respect to the free vertices of the pertinent triangles, while l n is the length of the edge. 

For obtaining a numerical implementation of (3), we substitute explicit free-space expressions in 
TD for the vector and the scalar potential and approximate the surface current distribution as in (2). 
In particular, the use of the quadratic interpolating functions enables us to evaluate in a closed form 
the derivatives with respect to time in (3). Finally, a linear system of equations is obtained which 
relates the unknown coefficients I n j to the incident electric field. It can be expressed in matrix 

form as follows: 


I_{j+ 1)= lif 1 F -I(k)+M~ l .B-lLgO'), (4) 

k=0 J 

where l(j) is a column vector with N e elements representing the unknown coefficients at the 
instant tj, E 1 ^ {j) is the vector containing the values of the incident electric field at each edge at 
the same instant. The matrices M, F and B assume values depending upon both geometrical 

and temporal parameters; however, they do not explicitly depend on time. The first term at the 
right-hand side of (4) reveals the recursive nature of the system, since the coefficients at a fixed 
instant of time depend on the values assumed by the same coefficients at previous instants. 

A previous analysis presented in [3] revealed the existence of a lower and an upper limit, AT^n 
and A7’ max , of the time step AT for which system stability is ensured. In particular, the above limits 
were found to depend on both size and shape of the triangular elements used for space 
discretization. Both AT min and AT max vary with the size of the patches as well with their quality 
factor [5] according to a simple proportionality law. Furthermore, AT max was also seen to be 
affected by the relative positions of the patches. In conclusion, the stability interval strongly 
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depends on the geometrical discretization of the body. In particular, the more uniform and regular is 
the patch modeling of the scatterer, the wider is likely to be the set of values of AT leading to a 
stable behavior of the algorithm. Depending on the scatterer and on the particular mesh employed, 
this interval may sometimes be void so that stability cannot be ensured. This frequently occurs when 
the body has wedges or tips. However, even if a stability interval does not exist, it is convenient to 
choose AT close to the values and AT max , in order to delay instability phenomena as much 

as possible. Further details can be found in [3]. 

3. Numerical and Experimental Results 

The limits of applicability of the EFIE simulation scheme illustrated above in the case of shelters 
with apertures in their metallic walls are discussed in this section by referring to a set of 
comparisons between numerical and experimental data. In this case, among the various parameters 
that might be of interest, we quote the field penetrating the inside the shelters. However, it is well 
known that accurate results for this class of problems can be accomplished only when defining 
equivalent magnetic currents at the aperture [6]. This has been confirmed in our experiments 
performed on a cube with an aperture in one of its walls. In particular, we observed that quite a 
good agreement exists between simulations and measurements for what concerns the external 
induced currents as well as the near field external to the body. Nevertheless, the transmitted field 
inside the objects is not properly evaluated by the EFIE in time domain when the aperture is small. 



&0 


Figure 1 . Normalized echo area and transmitted electric field at the center of the box as a 
function of the aperture dimensions. Continuous line: after [6]; dashed line: MoM-TD 
simulation. 


This aspect has been summarized in Fig. 1, where the normalized echo area o/S of a metallic cube 
with an aperture in one of its faces has been calculated and compared with results obtained by a 
MoM formulation in frequency domain which accounts for the equivalent magnetic currents on the 
aperture [6]. The operating frequency is chosen so that a/X=0.16. As can be seen from the same 
figure, an overall agreement is present for all the values of the ratio a/A for what concerns the 
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normalized echo area. Conversely, the EFIE formulation gives very similar results to the MoM 
formulation with magnetic currents for the transmitted electric field only when the ratio A/a tends 
to one. 

Measurements 

The theoretical behavior described above has been confirmed by some experiments conducted in 
an Electromagnetic Pulse (EMP) hardware simulator, 95 m long, 28 m wide, and with a height of 
19 m (Fig. 2). 



The basis of the simulator is a metallic plane realized by a buried metallic grid while all the 
structures supporting the aerial lines are made of wood and kevlar to perturb the electromagnetic 
field distribution as less as possible. The effective test volume is a box with a basis of lOmxlOm and 
a height of 6 m. A specific acquisition technique has been employed in the measurement procedure, 
which allows us to reconstruct the actual amplitude of the electric field impinging on the scattering 
object. This procedure relies on the definition and the evaluation of the system transfer function. In 
this way, the statistical approach is avoided and no averaging of the measurements is needed to 
compare the predicted and measured results. A detailed description of the instrument chain as well 
as of the acquisition procedure can be found in [2] and will not be reported here for the sake of 
brevity. Some apertures of different dimensions have been opened in a conducting cube 1 m on a 
side. In the following, we show two significant cases. In the first case, the top face of the 
conducting cube has been removed. In the other case, an aperture of dimensions 32 cm x 50 cm has 
been opened in one of the faces of the cube. In Fig. 2, we show the computed and measured electric 
field transmitted into the inside of an open cube whose upper face has been removed. The probe 
was located at a height of 40 cm from the ground and at the same distance from the remaining four 
walls. As can be seen from the same figure, the agreement between measured and computed results 
is very good for this case. Similar agreement has been obtained for the induced current densities and 
the near fields (field probe located close to the body but external to it). The presence of the ground 
plane in the EMP simulator has been accounted for in the simulations by doubling the dimensions of 
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the objects with respect to the ground plane, according to the image theorem, since they are 
actually in electric connection with the ground plane. The open cube has been meshed with 168 
elements and the time step was set to 1.53 • 10~ 9 sec. 



Figure 3. Electric field transmitted into the interior region of an open metallic box. Continuous 
line: measurements; dashed line: MoM-TD simulation. 

In Fig. 3 we show a comparison between theory and experiment for the case of an aperture with 
smaller dimensions, i.e ., 32 cm x 50 cm with the smaller side lying on the edge of the cube. In 
particular, the induced current density at point A, located at the top face (open face) at a distance of 
about 20 cm from the left comer of the aperture, is plotted vs. time in the figure. The body has been 

discretized with 199 elements and the time step was set to 1.4984 10"" 9 sec. Similar accordance 
between simulated and measured data has been obtained for the near field in the exterior region of 
the scattering body. Conversely, the EFIE in TD failed to adequately evaluate the transmitted field 
inside the body, confirming that a more general formulation involving the magnetic currents has to 
be used if one is interested in this parameter. 

4. CONCLUSIONS 

In this paper we addressed the problem of describing the transient scattering from metallic ob¬ 
jects. Measurements have been conducted in an EMP hardware simulator and compared with the 
results obtained by applying a stable formulation of the MoM in time domain, in order to validate 
the latter numerical technique. A measurement procedure has been determined which allows us to 
reconstruct the actual amplitude of the electric field impinging on the scattering object avoiding a 
statistical approach. The agreement between simulated and measured data confirms the accuracy of 
both the simulation scheme and the measuring procedure. The limits of applicability of the proposed 
EFIE formulation have been discussed for the case of conducting bodies with apertures. We under¬ 
line that a crucial point that allowed us to perform this comparative analysis was to obtain late time 
stability of the numerical procedure. Indeed, this is fundamental for simulating real experiments. 
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time (sec) 

Figure 4. Surface current density induced at point A on the exterior surface of a metallic cube 
with an aperture on one of its upper. Continuous line: measurements; dashed line: MoM-TD 
simulation 
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Abstract 

A novel technique for analyzing transient electromagnetic scattering from three-dimensional 
inhomogeneous dielectric targets is proposed. The electromagnetic volume equivalence principle is 
invoked to construct an integral equation for the electric flux density throughout the scatterer. This 
equation is solved using a marching-on-in-time scheme in which the electric flux density is expanded 
in space by volumetric rooftop basis functions defined on a tetrahedral mesh and in time by piecewise 
polynomial basis functions. The proposed method is validated for a representative dielectric structure 
by comparison of scattering data obtained with this method and various frequency domain techniques. 

1. Introduction 

This paper presents a volume time domain integral equation technique (VTDIE) for studying 
transient scattering from three-dimensional inhomogeneous dielectric objects. The analysis of 
electromagnetic phenomena in the presence of inhomogeneities is important for many applications. 
For instance, scattering by and detection of airborne particulates, medical diagnostics, interactions 
between antennas and biological bodies are all applications involving inhomogeneities. Furthermore, 
transient analysis capabilities are invaluable for studying wide-band or nonlinear systems. 

If the scatterer is piecewise inhomogeneous, a surface integral equation method can be used [1] 
for analysis. For highly inhomogeneous bodies, a volume integral equation method as presented by 
Schaubert et ah [2],[3] for the analysis of frequency domain phenomena is more suitable. Here, 
Schaubert’s work is extended to the time domain by constructing a VTDIE and solving it using a 
marching-on-in-time (MOT) scheme. 

The paper is organized as follows: the construction of the VTDIE and formulation of the MOT 
method are presented in Section 2, results that validate the method are shown in Section 3, and 
conclusions to this study are presented in Section 4. 


647 



2* Formulation 


Consider an inhomogeneous scatterer, occupying a volume V that resides in free space, defined 
as an unbounded medium with permittivity £$ and permeability ^ * It is assumed that the scatterer is 
isotropic, lossless, and nonmagnetic with frequency independent permittivity £(r) . The scatterer is 
illuminated by an incident electric field E 1 (r,r) (Figure 1(a)), which is zero on V for time t < 0. Our 
task is to determine the total electric field E(r,f) for all space and t > 0 . 

An integral equation for the electric flux density in V is derived to solve this problem. To 
facilitate this derivation, the total electric field is decomposed into incident and scattered components 
as 


E(r, t) = E z (r, t) + E s (r, t) (1) 

By invoking the volume equivalence principle, the scattered field is characterized by an equivalent 
volume current density J(r,r) that radiates in free space and is given by 

J(r,r) = ^(r)a ; D(r,r) (2) 

where Jr(r) = (e(r)-£Q)/e(r) is the contrast ratio and D(r,r) = £(r)E(r,r) is the electric flux density. 
The scattered electric field E^r,?) is expressed in terms of J(r,r) as 

E 5 (r,f) = -a r A(r,r)-V^(r,0 (3) 


where the magnetic vector potential A(r,r) and the electric scalar potential 0(r,r) are given by 


A.(r,t)=—d, f dv —— and 
x Air J? 


47t£(\t R J 


In Equations (4) and (5) R = |r-r'| denotes distance from the source to the field point and T =t~R/c 
denotes retarded time where c is the ffee-space speed of light. Combining Equations (l)-(5) results in 


, 2 1 dvK(r') 


Pfr'.T) 

R 


,V'-K(r')D(r\f) 


= E'(r,f), Vre V, 


which is the desired integro-differential equation for D(r,r). Upon solving this equation for D(r,r), 
E J (r,r) everywhere can be computed using Equations (2)-(5). 

Equation (6) is solved using an MOT scheme. To this end, the electric flux density D(r,r) is 
discretized using a set of N s spatial basis functions {f„(r)} and N t temporal basis functions {7y (r)} 
as 
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( 7 ) 


N s A',-1 

D(r,0 = I £ D n j t n (T)Tj(t), 

n =1 j-0 

where {Z> n j} is the set of unknown expansion coefficients. The spatial basis functions used are the 
volumetric rooftop basis functions proposed by Schaubert et al [2]. In order to define these functions, 
the scatterer V is approximated by a set of N e tetrahedral elements {V k }; the contrast ratio over 
element k is assumed constant and denoted K k . A spatial basis is associated with each triangular face 
S n of the tetrahedral mesh. With reference to Figure 1(b), the spatial basis functions are defined as 
f n (r) = a n p~ /(3v*) for re V* and zero elsewhere; V~ are the pair of tetrahedrons that share face n , 
a n is the area of face n, and v* is the volume of V *. The vector is defined from the free vertex 
of V£ to the position vector r; is defined similarly except it is directed towards the free vertex of 
V~ . The convention used is superscripts refer to tetrahedrons and subscripts refer to triangles. An 
important property of these basis functions is that their normal components are continuous across each 
interior face. Therefore the boundary condition requiring continuity of the normal components of 
D(r,r) across material interfaces is satisfied within V . Only one tetrahedron is associated with those 
faces residing on the boundary of V ; hence the corresponding spatial basis function is nonzero only 
over a single tetrahedron. Therefore, the number of spatial unknowns N s equals the number of 
triangles in the tetrahedral mesh. 

The temporal basis functions are chosen to be of the form Tj(t) = T(t- jAt) , where At is the 
time step size. In this study, cubic polynomial interpolation bases [4] are used. The order of 
interpolation is chosen high enough to accommodate the derivatives that need to be evaluated in the 
vector potential contribution of the integral equation. The time step is chosen as At = % /(5t» max ) 
where rn max is the maximum frequency of the incident field. 


The integral equation is now discretized using a Galerkin testing procedure. Upon substitution 
of the electric flux density given by Equation (7) into Equation (6), the resulting equation is tested at 
t = tj= jAt with the spatial basis functions f m (r), m = 1,..., N s , using the inner product 
<f (r),g(r)) = Jy tfvf(r)*g(r) . To solve this system by MOT, all terms involving electric flux density 
values at time steps prior to tj are moved to the right-hand side of the equation. In matrix form this 
yields 

Z 0 D ; =V ; -£:(:*Z*D ; _*, (8) 

where the m th components of the vectors D; and Vj are given by D m i and (f m (r),E z (r,q)^, 
respectively, and the elements of the sparse interaction matrix Z,- are given by 
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where the self Zf^, vector potential zf^ mn , and scalar potential zf^ mn contributions to Z i mn are 
given by 

z uL = (f m (r),71 i (Of n (r)/£(r))| /=o (10) 

Z tmn dvdfr t (T)K(r')±jp^ 

Z imn = (r) > V J dV ' T ~‘ (T)V '- *0^ 

Equation (8) relates the expansion coefficients at the j th time step to the incident field and the 
coefficients at prior time steps. Hence, all the expansion coefficients D n j can be evaluated by 
starting at the first time step, forming the right-hand side, and solving Equation (8) at each time step. 
Equation (8) is solved iteratively using the transpose free quasi-minimal residual algorithm with initial 
guess equal to the solution of the previous time step. Stability of the MOT scheme is achieved without 
any special averaging methods and can be attributed to the implicit nature of the method, the presence 
of the strong self-term in the integral equation, and careful evaluation of the integrals. 

Detailed formulas for the computation of Z z can be found in [2J. The volume and surface 
integrals that arise are computed using quadrature rules for tetrahedra and triangles [5],[6]. In this 
study, five point quadrature rules for tetrahedra and seven point quadrature rules for triangles are used 
for both evaluating the fields and testing purposes. When the source and field points are nearby in 
terms of the shortest excitation wavelength, the singularities are extracted by expanding integrals as: 

J duT.iiz)^- = J du'(T_i (f)— T-i (t ~t c ))^Y- + / duT-i(t-t c )^ (13) 

where g(r) is either 1 or p* and the integral is over either a tetrahedron or triangle. The expansion is 
offset by t c >R/c to ensure causality in the numerical implementation. The first term on the right- 
hand side is well-behaved and is computed numerically, while the second term is evaluated 
analytically using results from [7]. 

3. Results 

This section presents results generated by the MOT scheme outlined in Section 2. The incident 
field is chosen to be an x polarized modulated Gaussian plane wave propagating in the z direction 

E*(r,f) = xcos[(cr-r-z)6) 0 /c]c ^ ct rz 6a) /2a (14) 


( 11 ) 

/=0 
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where 0)q is the center frequency and o is the standard deviation. The effective maximum frequency 
of the signal is defined as CO r ^ =G)q+6c/<j . At this frequency the signal power is down by over 150 
dB from its peak value. 

Results are given in terms of the frequency domain radar cross section RCS (r, CO). The 
RCS (r, CO) is obtained from the electric flux density generated with the MOT scheme by first 
computing the far-field signature: 


E /ar (r, /) = -—(I - ff 

12ft 


N s 

ix 

n=l 


Kt r ( r-r'V, kZ r __ ( r-r'V , 

—J v+ p n D n t + — dv+—j v -p n D n t + — dv 




(15) 


This temporal signature is then transformed to the frequency domain yielding E f ar (r,co) from which 
the RCS (r, CO) is obtained by 


RCS(r,(0) = lOlogjo 


4ft 


|E /ar (f,fi)) 




(16) 


where I E l (©) I is the magnitude of the Fourier transform of the incident field given by Equation (14). 


To validate this method, scattering from a dieletric spherical shell is studied. The inner shell 
radius is 0.75 m and the outer is 1 m; the permittivity is 2eq . The parameters chosen for the incident 
pulse are <7 = 2ftcj(OQ =2.89 m. The shell is discretized into 927 tetrahedrons yielding 2120 
unknowns. At the effective maximum frequency there are approximately 10 spatial unknowns per 
wavelength in the dielectric. The method, referred to as the volume marching-on-in time method 
(VMOT) in the figures, is run for 400 time steps where the time step size is 0.493 ns. 

The VMOT results are compared with those computed by a Mie series solution and a frequency 
domain volume methods of moments code (VMOM) [2]. First, the backscattered RCS is compared 
over the entire band of the incident pulse (Figure 2(a)). Next, the RCS is examined at a single 
frequency (Figure 2(b)). Good agreement is achieved. 


4. Conclusions 


An MOT scheme based on a volume integral equation has been described for transient 
scattering from inhomogeneous dielectric objects. It is based on rooftop basis functions defined over 
tetrahedral volume elements, which ensure field continuity. This technique has been validated through 
numerical examples. While the formulation presented here applies to dieletric bodies, this method can 
be easily extended to encompass scatterers with non-trivial permeabilities by including an equivalent 
magnetic current density and then solving a pair of equations for the electric and magnetic flux 
densities. The MOT scheme presented here is prohibitively expensive (it scales as Q(N t Nj )); work is 


651 


being done to accelerate it using the plane-wave time-domain algorithm [8], which would reduce the 
complexity to 0(N t N s ). 
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(a) (b) 

Figure 2. Scattering from a spherical shell, (a) Backscattered RCS vs. r// l, where r IX is the outer 
radius of the shell in wavelengths, (b) RCS vs. elevation angle for r/X = 0.4. 
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Abstract - The use of wavelet basis functions for the efficient solution of Partial Element Equivalent Circuit 
(PEEC) models is studied. It is demonstrated that the use of wavelets for expansion and testing functions produces a 
sparse system matrix. Here, this effect are examined by varying the wavelets order, the thresholding level and 
decomposition levels. 

I. Introduction 

The capability of the Partial Element Equivalent Circuit (PEEC) method [1,2] to model any electromagnetic problem - 
formulated in an integral equation form- by means of an equivalent circuit renders this technique very appealing. In the 
recent past, several advancements of this numerical technique has been developed: from the introduction of incident field 
and time delays [3] to hybrid models taking into account conductive surfaces and complex cables [4,5], up to the 
introduction of non-orthogonal elementary PEEC cells to improve the accuracy for complex geometries and to decrease the 
computational efforts [6]. Because the issue of stability for this algorithm in a marching-on-in-time solution has been 
already analyzed and stable implementations suggested [7] the only block preventing its widespread use appears to be its 
computational complexity. In order to facilitate the fast evaluation of transient electromagnetic fields by means of PEEC a 
wavelet approach has been developed [8]. This work continues that study and performs a parametric analysis of the sparsity 
and threshold properties of the resulting matrices due to the application of the wavelet transform [9] to a PEEC time 
domain solution. The paper is organized as follows: in Section II a brief derivation of the PEEC technique is given together 
with the application of the wavelet transform in order to decrease the number of non-zero entries in the global system 
matrix and hence speed-up the system solution. Section III is devoted to discuss the properties of the wavelet-transformed 
matrix for various types of wavelets and threshold’s levels. In the same section the procedure is applied to the time-domain 
analysis of a multiconductor transmission line (MTL) and the results compared with those obtained by different 
approaches. At the end, Section IV draws some conclusions pointing out the main advantages of the proposed technique 
and some critical point that deserve more investigation. 

II. PEEC BASIS AND WAVELET PEEC (W-PEEC) FORMULATION 


The PEEC method is a numerical technique for the solution of integral equations in form of an equivalent circuit [1,2]. The 
electric field in the free space at a conductor can be written for the homogenous case as 


E*(r,t) = —’~ + f dSLd o(r 1 r’^lv’ + V- 


where t’= t - --- denotes the retardation time and c is the phase velocity of the medium. J is the current density 

c 

flowing in the conductor, q is the charge density and G^F) is the Green’s function. In order to solve (1) for the unknowns 
J and p, the current and charge densities are spatially expanded in pulse basis functions. This results in a spatial 


discretization of the conductive regions into volume and surface cells. The circuit interpretation of the electromagnetic 
problem is obtained by weighting the resulting equation in the Galerkin approximation sense. 

The development of the overall PEEC circuit is obtained by combining the resulting equation for the voltage drop along 
each appropriate direction in a conductor cell with the discrete version of the conservation of charge equation in form of 
KCL at each node in the equivalent circuit. 


654 





The elementary PEEC model is constituted by an inductive cell and two capacitive half cells as shown in Fig. 1. Having in 
mind this model three sets of equations can be written enforcing the Kirchoff voltage law (KVL) to both the longitudinal 
and vertical branches and the KCL to each capacitive node. Let the entire equivalent circuit contain N capacitive nodes and 
M inductor cells. A first set of equations is given by using KVL which takes the matrix form (capital bold letters refer to 
matrices): 

~ AV-Lp ir =Vs (2) 

where A is the node-branch matrix, V are the capacitive nodes’ potentials, in which R and Lp are the matrix of the 
conductors resistances (it is diagonal) and partial inductances respectively, I L holds the inductor currents and V s are the 
longitudinal voltage sources. 

By applying the KCL at the node k (see Fig. lb) there are four contributions: the currents on the branches n and n+1, the 
current on the capacitive branch towards infinity and the current I Iumped on the generic lumped element between nodes j and 
k. The KCL becomes 

dV dV r t 

lumped lumped ^ + ^ lumped + A I l ~ (3a) 


In (3a) and (3b) matrix operators C lumped ^j, L Jumped ^-j are NxN matrices assembled according to the node¬ 

voltage analysis of a lumped-element circuit [15]. Is is the excitation vector that describes the amount of current injected 

into the nodes by independent current sources, F is a NxN diagonal matrix operator in which f^. = -i- and pu< is the self 

Pkk 

potential coefficient of node k, V c contains the voltage drops on the capacitances Cj. A third set of equations is given by the 
KVL applied to the vertical branches of the circuit in Fig. 1: 

-V + HV c =0 (4b) 

where H is suitable NxN matrix whose entries are h^ = . The diagonal elements of this matrix are set to unity. These 

Pkk 

relations can be cast in the following set of differential equations 

-1 N lumped +C| mnpri Aj F(Aj _ A T 

0 -1 N H 0 y o 

0 -A -R -L P — V c ~ Vo 


It can be easily shown that the product of matrices F and H * 1 gives FIT 1 = P ' 1 . Wavelets are a class of recursively defined 
orthonormal basis functions. The complete orthonormal set is formed by taking dilations and translations of a single 
function y(t) 

Vjilt (t) = 2* V (2 j t-k) (6) 

The function y(t) has a companion, the scaling function <p(t). Each of them are completely determined by a set of 
quadrature mirror filter coefficients, h(n) and g(n) as in [10] 

N c 

<p (t) = V2^h(n)p(2t-n) (7) 


V (t) = V2irg(n)?>(2t-n) 
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Fig. 1 - Elementary PEEC cell: a) discretization of the physical conductor, 
b) equivalent PEEC circuit. 






In order to obtain the wavelet formulation of the PEEC method from equation (5) the wavelet representation of the time- 

derivative operator — is needed. In representing such operator in a wavelet basis, the so-called non-standard form [11] 
dt 

(NS-form) is chosen. The NS-form algorithm leads to a process of complexity 0(N) while its standard form counterpart 
requires N 2 operations. Generally a truncated matrix representing the derivative operator in the NS-form has more zero 
entries than its standard form and, since the number of operations is proportional to the number of non zero entries, it is 
desirable to perform computation directly using a matrix represented in the NS-form. The property of vanishing moments, 
typical of orthonormal basis functions, causes the wavelet coefficients corresponding to entries far from the diagonal to be 
smaller than the diagonal ones allowing the thresholding process. The NS-form of the wavelet matrix operator T 

(representation of —) is given by a chain of triplets 
dt 

T = {A j Bj (9) 

and admits a recursive definition 

T.=!>' noi 


flni Of Aj. Bj, Tj and r^, of Tj are computed as in [8] and, therefore, the representation 


The matrix elements cti, 


of — is completely determined. The application of the Wavelet Transform to the system of equations (5) gives 
dt ~ ' ' 

”^ N,w (g lumped, w ^lumped,w'T.) F w T “A w ^Liurapcd,w f^S.w 1 

0 -1m... H... 0 0 


-1n,w 

(g lumped, w + Gi umpc< i (W T) 

F W T 

-A w 

0 

~ ^ N,w 

H w 

0 

0 

" A w 

-R w 

"Lp,, 

^'lumped.w'F 

"A w 

0 

0 


where T is the sparse representation of the differential operator and I s>w and are the wavelet transforms of the exciting 
sources. Combining the previous equations the final system in the unknown nodal potential is obtained 
l A vv T (^upmed,-vyT)' 1 +G lupmed;w +C ]impedw T + A W T (R W + Lp w T')' 1 A w +P W = 

Is.w +Lp w T)v Sw 

which can be rewritten as 

Y W V W = I S (13) 

where I s w takes into account the lumped and the distributed sources expanded in the chosen wavelet basis. Having the 
wavelet basis a local support the matrix T has a banded structure with few non vanishing codiagonals. Due to the high 
sparsity of the T matrix, the matrices L lumpcd> w , A w , C }uinpedi w and R w have the same sparse structure. Great advantages in 

computing time can be achieved by using a sparse treatment of equation (13). When the wavelet coefficients of the nodal 
potential have been evaluated, all the other expansion coefficients can be obtained from the Kirchhoff equations in the 
transformed domain. Finally the physical time domain solutions are derived by means of the reconstruction algorithm 

-H» +C0 Nj 

Vi(t)= X v £wn,,k( t )+ £ 2>ii(k) Vjfk (t) ( 14 a) 

k=-« k=-» j=j 0 

■HO +03 N d 

Im(«)= Z I lV(k>j.,k© + 2 £l dm 0.k) Vj , k (t) (14b) 

k=^0 k=-«j=j e 

in which V*[ (k), I (k) and vjj (k), I j m (k) are the approximation and detail coefficients of the nodal potential V,(t) 
of the cell i and current I m (t) of the cell m respectively and N d is the finer decomposition level. 


III. Results and discussion 
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The parametric analysis is carried on a structure that consists in a 10 cm long two conductors transmission line with 
frequency independent parameters (see Fig. 2). The conductors width and thickness are w = 360 urn and t = 36 pm 
respectively. The line is excited by a voltage source V s set to a gaussian pulse: 

Vs = V 0 exp((t-t <1 ) 2 /(2A 2 )) (15) 

where V 0 = 1 V, center t<j = 10 ns, and semi-width A = 2.12 ns and the source internal resistance is R s = 50 Q. The loads 
are resistive terminations indicated as R^ = 10 Cl, R >NE! - 75 Q and Rj^ = 10 Q. The short-circuit condition on Rfei is 
also considered. 



v 



Fig. 2 - MTL and load terminations: 

Rs “ 50 Q, R>te 2 = *75 Q, Rpr■ = 10 C2 and R ^£2 “10 O. 


As orthonormal basis are used the Daubechies wavelets (db) of seven different orders, from 3 to 9. The decomposition 
level chosen to obtain the transformed system (13) is Nd = 2. The property of vanishing moments, typical for these basis 
functions, causes the wavelet coefficients corresponding to entries far from the main diagonal of Y w to be smaller than 
those close to the diagonal. To take advantage of this, elements of Y w that are smaller than an element threshold r are 
discarded and the remaining elements of Y w are stored in a sparse matrix form. This allows matrix-vector multiplications to 
be carried out in a time proportional to the number of non zero elements in the sparse matrix, speeding the solution of (13). 
As in [12] the chosen threshold is 



06) 


M 

where 8 is a constant, M is the dimension of Y w and |Y w | w = max J ^|F w (/,y)|. The values of x changes as the 
decomposition level and wavelet order change. For the application in Fig. 2 Table I reports the threshold values. 

Table I - Threshold values as function of wavelet order and decomposition level. 


Wavelet Order 

N d = 2 

N d = 3 

3 

51.287 

24.544 

4 

34.567 

19.567 

5 

28.963 

13.619 

6 

24.038 

11.306 

7 

20.181 

9.504 

8 

17.659 

i 8.256" 

9 

15.483 

7.345 
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Defining the sparsity of a matrix as the percentage of non-zero entries in the matrix, it is significant to show how the 
wavelet order affects the sparsity of Y w . This results in Fig. 3 where as the wavelet order increases the sparsity decrease up 
to the 42.78%. 



Fig. 3 - Percent sparsity of matrix Y w 
for increasing wavelet order. 


The sparsity is also dependent on the decomposition level. Table II gives, for different wavelet order and decomposition 
levels, the percentage of non-zero entries in Y w . 

Table n - Percent matrix sparsity as function of wavelet order and decomposition level. 


Wavelet Order 

N d = 2 

N rf = 3 

3 

51.2429 

36.9130 

4 

50.1022 

35.7640 

5 

48.3381 

34.4762 

6 


33.6725 

7 

45.3298 

32.4155 

8 

44.1456 

31.4535 

9 

42.7800 

30.3467 


It is interesting to note how with the increasing of the decomposition level the threshold decreases. One would expect that 
the sparsity of the matrix would have an opposite trend. In the case of wavelet decomposition this is not true (as shown by 
the two last columns of Table II) because the entries of matrix Y w are strongly dependent on the decomposition level. 
Although the thresholded matrix retains only the 30% of the elements of Y w the information contents it is not lost and good 
solution accuracy can be obtained. As measure of this accuracy we use the relative error between the solution computed 
using the full matrix Y w in (13) and the solution using the thresholded sparse matrix. Fig. 4 shows the transient behavior of 
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Figures 4 and 5 reveal the capability of the wavelet basis to accurately represent very fast transients. Although the percent 
errors are very small, it should be noted that in the case of short circuit (Rfei=0) - when the reflections along the 
transmission line are more preponderant than in the matched case - the percent error increases. 


IV. Conclusions 

A wavelet formulation for the PEEC method in time domain has been presented pointing out some numerical features of 
the wavelet decomposition; the described use of classical wavelet basis reduces the global solution time of a constant 
factor. The thresholding of the wavelet-transformed matrix has shown the capabilities of these basis functions in retaining 
the essential content of information also with an high degree of sparsity. 
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Abstract: We describe the development of a 
hybrid integral equation time domain / physical 
optics time domain approach to the analysis of a 
horn - dish pulsed antenna, intended to transmit 
veiy short duration, well-focused pulses. The 
hybridisation developed is shown to be effective, 
with good internal consistency achieved, and to 
provide the ability to perform design studies and 
optimisations. 

1. INTRODUCTION 

In this paper we will outline the modelling of a 
horn - dish antenna system intended to transmit 
short pulses of EM energy. 

The requirement is to produce a short high energy 
pulse in the far field, which is spatially as 
narrow as possible, and which temporally has 
suffered as little smearing as possible in its 
focussing and transmission. In particular, the 
initial rise time of the exciting signal should be 
preserved. 

The feed to the transmitter is a short (~lOOps) 
signal from a high voltage (~10kV) pulse 
generator, which is directed along a stripline, 
through a horn, and onto a dish. From here it is 
reflected away to produce the required far field. 

Our intention was to develop methods to model 
this system, to provide a tool for analysis and 
design optimisation, and to demonstrate the 
methods to at least proof of principle stage. 

2. SYSTEM SPECIFICATION 

The working configuration adopted for model 
development purposes was as shown in Figure 1. It 
comprised a pair of parallel plates forming a 
simple transmission line, diverging and widening 
to form a horn, facing towards a large dish. The 
transmission line region was filled with dielectric 
of relative permittivity 2, with the high voltage 


pulsed signal introduced via a co-axial feed at its 
upstream end. 

Some dimensions and other data are given in 
Table 1. 


Pulse: Gaussian, fwhm (ps) 

140 

Dish diameter (m) 

2.0 

Dish focal length (m) 

0.866 

Horn mouth to the plane of the rim 
of the dish (m) 

0.58 


Table 1: Dimensions and other data. 


3. MODELLING 
Issues 

The overall size of the device is of order 3m, and 
the pulse width of interest is ~40mm, making the 
problem electrically large. The problem is also a 
truly time domain one. 

There are of course many powerful tools for 
antenna analysis in the frequency domain. These 
are mostly based on the method of moments 1 ' 3 , 
with an increasing number of frequency domain 
finite element treatments appearing 4 , typically 
with domain closure achieved by hybridisation 
with an integral equation treatment 5 . These are 
restricted to structures of relatively modest 
electrical size. Larger components would generally 
be treated by an optical technique, hybridised to 
the method of moment treatment as appropriate 6 . 
Given the truly time dependent nature of the 
problem, they are not an attractive option here. 

An FDTD approach 7 would have involved a 
rather large analysis, and with the dish being 
curved, and the horn plates not lying in co¬ 
ordinate planes, would have been vulnerable to 
staircasing effects, requiring fine discretisation. 
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The approach we adopted was a combination of 
integral equation time domain (IETD) analysis 8 ' 
10 for the horn, hybridised to a physical optics 
time domain (POTD) 11 treatment of the dish. 

Formulation 

The hybrid treatment adopted may be 
summarised: 

(i) Compute the surface field history on the horn 
in isolation using the usual integral equation 
(here, the MFIE) in the time domain. 

(ii) Compute the fields induced on the dish by the 
surface horn fields, using a time domain physical 
optics approximation on the dish, and using as the 
excitation for this an integral over the (retarded) 
surface fields on the horn. 

(iii) Compute the time dependent near and far 
fields around the structure by integration of the 
retarded surface fields on the dish and horn, from 
whatever point it is desired to evaluate the field. 

Feed region and hom 

We begin the integral equation analysis at the 
end of the transmission line, where the plates 
begin to diverge and widen. (Modelling of the 
details of the feed and line would be better done 
by FDTD, and this was the intent of another part 
of this work, involving others, which did not 
proceed.) We must then represent adequately the 
entry of the wave from the transmission line into 
this diverging region of the horn. We model the 
field propagating down the line into the throat as 
a plane wave localised in the throat area. We 
make the approximation that the field being 
emitted from the transmission line is determined 
only by events and geometry upstream; the field 
entering the integral equation region is a 'given’. 
Integrations over this 'source plane’ in the throat 
region serve as the excitation to the integral 
equation model, taking on the role of the incident 
wave term. It is to be expected that the far field 
eventually produced is independent of the details 
of the modelling of this source plane, and indeed 
computations in which source plane areas 
differing by a factor of 16 were used will be seen 
shortly to yield essentially identical results. 

The MFIE is then evaluated for the hom in 
isolation; effects of the dish are neglected. The 
justification is two-fold, (i) The fields scattered 


back to the horn from the dish should be an order 
smaller than those directly induced on the hom by 
the source, (ii) Causality means that any 
influence on the fields on the dish which this does 
have will be some two horn-dish transit times 
after the main peak arrives at the dish. The 
early fields, which are of prime interest, are thus 
wholly unaffected by the approximation. 

The dish 

The dish is large, and the local radius of 
curvature of its pec surface is much greater than 
the wavelengths of interest, allowing fields 
induced on it to be estimated using the physical 
optics approximation. The excitation for this, the 
‘incident wave’, is obtained by integration over 
the surface of the horn, with of course the horn 
surface fields being evaluated at the appropriate 
retarded times. 

The far field 

Far (and near) fields are found from integration 
over both the dish and horn. Note that for the far 
field the T /R’ term dominates over the l/R 2 , 
leaving a far-field temporal dependence (mostly) 
determined by the time derivative of the pulse 
fed to the system. 

4. RESULTS 

Dish location and sizing 

The dish diameter, focal length and horn - dish 
separation are the main degrees of freedom 
available in ensuring that as much as possible of 
the energy leaving the horn impinges on the dish 
(subject to the observation that decreasing the 
separation too much makes the horn ever less of a 
‘point source’, and harms focussing). We used an 
assessment of the spatial distribution of 
transmitted energy to guide the dish design. The 
time integral of the square of the modulus of the H 
field provides an indication of the energy passing 
a point. Values were computed on 2m radius circles 
in the E and H planes centred on the mouth of the 
hom. This is shown in Figure 2. 

From this a dish diameter of 2.0 m was 
tentatively adopted, arranged to subtend a full 
angle of 120 degrees, with a view to capturing as 
much energy as practical within this constraint. 
The mouth of the horn was located 0.58m from the 
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plane of the rim of the dish. For the mouth of the 
horn then to be at the focus required that the 
directrix of the parabola be located 1.16m from 
the origin of co-ordinates. 

Far field produced 

The time dependent far (E) field, on boresight, is 
shown in Figure 3. 

The relative angular distribution of the energy 
transmitted from the. dish, evaluated at a 
distance of 100m from the dish in each of the E 
and H planes, is shown in Figure 4. The actual 
quantity plotted is the time integral of | H | 2 . 

Validation and verification 

Comparison with empirical formula 

There is an empirical formula for evaluation of 
far-field intensities from a horn: 



where L is characteristic of the horn mouth size, d 

dV 

the distance away, c the speed of light, and — 

dt 

some average value during the pulse rise. 

Taking d= 100m, L - 0.25m, and the derivative as 
(10 4 )/(140xl0 12 ) gives an E of 

460 Vm' 1 at 10m 
or ~50 Vm' 1 at 100 m. 
or ~5 Vm' 1 at 1000 m. 

Our computation gave a peak field, after focusing 
by the dish, of 210 Vm' 1 at 100 m. Being a modest 
multiple of the approximate empirically 
predicted field due to the horn alone, this is 
certainly plausible. Actual results will be 
discussed further below. 

Source plane modelling 

In Figure 5 and Figure 6 are shown the radiation 
patterns of the horn alone with a series of source 
planes differing by a factor of 16 in area. As is 
seen, essentially identical results are obtained in 
all cases. 


Time integral of the far field 

The far scattered field is dominated by quantities 
proportional to the time derivative of the field; 
terms proportional to the field itself decline with 
one higher power of the distance away (1 /R 2 , 
rather than 1/R). Since the excitation is Gaussian, 
whilst the far field will not be a pure 
differentiated Gaussian, we expect the field there 
to be both positive and negative, and would hope 
that the time integral of the far field was small. 
(An alternative interpretation of this statement is 
that ‘DC’ components will not propagate to the 
far field.) 

The time integral of the boresight far field was 
computed. To provide a measure against which to 
assess ‘small’, the time integral of the modulus 
was also evaluated. The ratio of these integrals 
was 100:1, providing a very satisfying assurance of 
the internal consistency of all the various 
calculation stages. 

Frequency spectrum of the far field 

The frequency spectrum of the input pulse, a 
Gaussian, is itself a half-Gaussian, with a 
significant DC / low frequency component. For the 
reasons discussed above, we would expect this zero 
frequency part not to be present in the far field. 
Figure 7 shows a frequency spectrum of the pulse 
eventually reaching the far field. The zero 
frequency component has been eliminated, 
providing further evidence of the internal 
consistency of the calculations. 

5. DISCUSSION AND CONCLUSIONS 

The combination of integral equation time domain 
and physical optics time domain methods 
assembled seem well able to model the pulsed 
antenna. Internal quantitative consistency is good, 
and behaviour predicted is qualitatively as 
would be expected. 
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Figure 2 Time integrals of | H | 1 on circles of 2m 
radius centred on the mouth of the horn in the E 
and H planes 



IW(nt) 


Figure 3 The E field on boresight versus time 
evaluated at distances of 1000m from the dish. . 


Figure 1: Schematic of the feed - horn - dish 
combination: elevation and plan views. 
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Figure 5 Radiation pattern of the horn alone (at a 
frequency of 3 GHz, extracted from the pulsed 
result by Fourier transformation): E plane, with 
the three different source regions. Note that the 
scale on these graphs is linear, not logarithmic 


Frequency (GHz) 

Figure 7 Fourier transforms of E-field on boresight 
@1000m and input pulse versus frequency 
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Fast Solvers for Electromagnetic Simulations-A New Age Analysis Tool 
W.C. Chew 
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Abstract 

This talk will overview the development of solution techniques for Maxwell's equations as an 
analysis tool. It reviews the solution techniques that were prevalent around the beginning of the 
20th century. During that period, closed form solutions were sought for simple shape objects 
such as spheres, cylinders, half-planes, half-spaces, etc. That was the age of simple shapes. As 
the demand of science and engineering called for solutions to more complex problems, scientists 
and engineers developed approximate methods to solve Maxwell's equations approximately. This 
was often the combined use of asymptotic and perturbation methods. Examples of such are the 
asymptotic high-frequency methods in scattering, the small perturbation method in rough surface 
scattering, and the match asymptotic methods in boundary value problems, multiscale analysis in 
random media, and optics. Consequently, a larger class of solutions can be sought by such 
techniques, empowering the solutions available to Maxwell's equations. This was the age of 
approximations. 

The advent of computer technology spurred interests in solving Maxwell's equations 
numerically. Much of these works were to find the corresponding matrix equations, which were 
sufficiently well posed and represented a high fidelity representation of the original Maxwell's 
equations. This lifted the restriction on solution types solvable for Maxwell's equations, and 
greatly expanded the scope and versatility of Maxwell's solvers. That was the age of numerical 
solvers. However, due to the inefficiency in the solution techniques, only small problems can be 
solved. 

The most recent developments of analysis methods for Maxwell's equations are the fast solvers. 
These fast solvers have been developed for electrostatics, electrodynamics, as well as fast solvers 
that are continuously valid from static to electrodynamics; for the frequency and time domains, 
as well as for layered media. These fast solvers use resources that are orders of magnitude 
smaller than traditional numerical solvers. Their importance in electromagnetic simulations is as 
important as fast Fourier transforms in signal and image processing. It is quite certain that these 
solvers will precipitate a revolution in analysis methods in electromagnetics. Problems that 
required 10 years of computer time to solve in the past can now be solved within a day. 
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Abstract 

The efficient solution of linear systems of equations is of considerable interest in computational 
electromagnetics. Dense system as they arise in Moment Method solutions are usually solved by 
a general LU factorisation (Gauss algorithm). Iterative algorithms are widely used when systems 
with sparse matrices must be solved. This paper analyses how iterative algorithms can be applied 
to the dense matrices of Moment Method solutions. Two test cases are studied closer. 

1 Introduction 

Many numerical methods for electromagnetic problems necessitate the solution of large linear 
systems of equations like the following 

AX = B (1) 

with matrix A , excitation vector B and solution vector X. The vectors have n elements for the n 
unknowns, the matrix is then square with size nxn. 

The efficient solution of such linear systems of equations is of considerable interest. A de¬ 
crease of computation time for one frequency step can decrease significantly decrease the overall 
computation time for a broad band computation. Presented here is the potential use of iterative 
algorithms for the solution of systems with dense matrices. One major issue is hereby the ac¬ 
tual computer hardware the code is supposed to run on. Today, the use of hardware dependent 
libraries allows to achieve almost the theoretical maximum performance on a given processor. 
After presenting some characteristics of modem CPUs, we investigate the possible use of iterative 
algorithms by looking at two test cases. 

Mathematically the eigenvalues and the singular values of the coefficient matrix have a sig¬ 
nificant influence on the convergence of iterative solvers, we therefore investigate the position of 
eigenvalues of two Moment Method matrices, and how a preconditioning can influence the position 
of the eigenvalues. 

In this paper we use two moderately large matrices. One is taken from a recent paper [1] and 
produced by a 9 by 9 array of wiregrids. The second one is taken from the example files coming 
with Wiregrid (see html://www,emss.co.za/) and models a car. Pictures of the two models are 
shown in Fig. 1. The number of unknowns is 1944 for the array and 694 for the car. 
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(a) 9 by 9 array of wiregrids. (b) Car model. 

Figure 1: Wiregrid models used for NEC to produce Moment Method matrices. 


2 Solving Linear Systems of Equations 

2.1 Iterative algorithms for sparse matrices 

Local methods like the finite difference or the finite element method involve the solution of systems 
with sparse matrices (i.e. most of the matrix elements are zero). The solution of these matrices is 
made by using iterative methods which produce a converging series X t for the solution vector X 
by multiplications with the matrix A and possibly its transpose A T . The computational cost is 
therefore proportional to the square of the number of non-zero elements for sparse matrices. This 
is then multiplied with the number of iterations. 

Various iterative methods are known [2]. Usually a method is not used directly on the linear 
system itself, but on a modified system by applying a preconditioner 


S 1 AS 2 X = S 1 B (2) 

with the preconditioner matrices Si and 52. The intelligent use of preconditioning matrices 5 can 
significantly increase the convergence rate to solve (1). 

2.2 Direct and iterative algorithms for dense matrices 

In contrast to the local methods global methods as the Moment Method produce a system where 
the coefficient matrix A is dense, that is that all elements of A are non-zero and not neglect-able. 
The method of choice for solving systems with general matrices is the Gauss elimination. It 
comes with a computational cost of O (fra 3 ), hence increases with the third power of the number 
of unknowns. 

Iterative algorithms compute again the solution vector by subsequent multiplying an iterate 
by the system matrix. Therefore the computation cost is the number of iterations times the 
square of the number of unknowns N. If the number of iterations is much less than the number of 
unknowns, the iterative algorithm will produce a result in a shorter time than the direct method. 
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Table 1: Run times for a 9 by 9 array of wiregrid patches 



f = 299.8 MHz 

/ 

- 500.0 MHz 

algorithm 

factor solution 

total 

factor 

solution 

total 


in s in s 

in s 

in s 

in s 

in s 

original LU 

378.19 1.10 

379.29 

376.79 

1.13 

377.92 

LAPACK LU 

72.76 0.44 

73.20 

75.47 

0.45 

75.92 


A big influence on the computation time has the actual hardware the code is running on. 
Beside the ability of the compiler to produce highly optimised code, the programming of the code 
for a particular hardware is one major parameter for high performance code. 

3 Computer Hardware Considerations 

Modern computers are fitted with CPUs which implement many features known from high-end 
computers 10 years ago. Parallel computing, pipelining and integrated multiply-adds are included 
along with multiple stage cache memories in most modern RISC processors known in workstations 
and also in CISC processors as in PC computers. High programming skills are needed to use 
these high performance possibilities. As each processor has its own characteristics, these skills 
are also highly architecture dependent. Therefore the use of machine dependent libraries (BLAS, 
LAPACK, QMRPACK and the like) is mandatory to achieve the best performance. 

The use of blocked algorithms [3,4] for the well-known Gauss LU factorisation uses the memory 
system in an optimised way and allows to achieve almost the theoretical maximum performance 
on a given hardware. Table 1 shows a comparison of computation times for a text-book im¬ 
plementation of a Gauss LU factorisation in NEC 4, and the corresponding high performance 
implementation making use of the processors abilities (run times on IBM SP nodes, see [1]. Other 
run times in this paper are on HP C160 workstations). 

4 Eigenvalues and Iterative Solvers 

The distribution and position in the complex plane of the eigenvalues and the singular values of 
a coefficient matrix have a decisive influence on the convergence of iterative solvers. Different 
algorithms depend on either the eigenvalues or the singular values: The Conjugate Gradient 
Methods for the normal equations (CGN) is solely dependent on the singular values, whereas the 
Conjugate Gradient Squared (CGS) and the Biconjugate Gradient (BCG) are also influenced on 
the position of the eigenvalues in the complex plane [5]. We therefore determine the eigenvalues 
and the singular values for the above matrices. 

In Fig. 2 the eigenvalues and the singular values are shown for an unsorted matrix (basically 
the segments axe numbered in an arbitrary way). The eigenvalues are all scattered around in the 
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Figure 3: Singular Values and Eigenvalues of the Car Matrix. Elements are Sorted by Geometrical 
Proximity. 


complex plane, there are two very large singular values and two widely eigenvalues scattered in 
the complex plane, rendering convergence very slow. 

Now the segments are numbered so that two geometrically close segments are also close in the 
matrix. The eigenvalues and the singular values are shown in Fig. 3. The sorting of the elements 
can be considered as a preconditioning with the permutation matrices. The run-times obtained 
for the two cases is reported along with the results for the unsorted matrix in Tab. 2 and Tab. 3. 
As the matrix for the array case is already well sorted, no sorting was necessary in this case. It 
can be seen that no convergence was achieved for the car, the linear system is not solvable with 
a brute force application of an iterative algorithm. 

Whereas the array of wiregrids is building a strongly diagonally dominant matrix, the unsorted 
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Table 2: Run-times for Test Case Array. - NC = No Convergence. 


Algorithm 

Preconditioner 

Addtl. Storage 

Iterations 

Runtime / s 

Blocked LU 

- 

- 

- 

162.62 

TFQMR 

- 

0 

127 

62.12 

TFQMR 

scaling 

0 

74 

36.56 

TFQMR 

band 

37.4 MB 

10 

14.94 

TFQMR 

block (3 blocks) 

19.22 MB 

6 

97.12 


matrix for the car case is not exhibiting this dominance. The intensity plots in Fig. 4 shows clearly 
a strong main diagonal for the array case, but a very uniform matrix for the car case. The array 
has a very strong dominant main diagonal, hence the use of iterative solvers is possible. Iterative 
algorithms are less advised for the car due to the unfavourable structure of the matrix. 

4.1 Preconditioned iterative algorithms 

In Tab. 2 and Tab. 3 we report the run times for different preconditioners. The iterative algorithm 
used for all computations is the Transpose free quasi minimum residual iterative algorithm [6]. 
During our computations this algorithm converged the fastest. 

The first preconditioner is the so-called Jacobi scaling preconditioner. The main diagonal 
elements are normalized by two diagonal matrices Si and S 2 . The additional memory usage is 
just one vector of size n, as only the nonzero elements of Si and S 2 have to be held. Additionally 
the two vectors have the same elements so that one vector is sufficient. 

The second preconditioner is the block preconditioner. The left precondition matrix S\ is 
formed by a block diagonal matrix. The blocks on the main diagonal are the inverse of the same 
blocks in the system matrix A . The matrix Si A has identity blocks on the main diagonal and 
dense subdiagonal blocks. It is closer to the identity which can be very well seen in the eigenvalue 
plot in Fig. 3. The eigenvalues are much closer to unity, the singular values are much lower. An 
iterative solver will converge in fewer steps. 

The third preconditioner is a band matrix of the inverse of the main diagonal plus several 
sub-diagonals. The result of the preconditioner matrix and the system matrix Si A will again be 
closer to the identity. In the tables can be seen that convergence can be achieved even for the car 
case, however with more memory requirements. For the small matrix size of 694 elements, the 
run times for the preconditioned iterative algorithms axe higher than the corresponding Gauss 
factorisation. For bigger matrices this can revert, so that iterative solvers with an intelligent 
preconditioner may solve big systems faster than a direct algorithm. 
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(a) 9 by 9 array (b) Car 


Figure 4: Intensity Plots of the Two Test Matrices. 


Table 3: Run-times for Test Case Car. - NC = No Convergence. 


Algorithm 

Preconditioner 

Addtl. Storage 

Iterations 

Runtime / s 

Blocked LU 

_ 

- 

« 

2.5 

TFQMR 

- 

0 

NC 


TFQMR 

sorting 

- 

NC 


TFQMR 

scaling 

- 

NC 


TFQMR 

band 

2.2 MB 

NC 


TFQMR 

block (3 blocks) 

2.4 MB 

NC 


TFQMR 

sorted 4- band 

2.2 MB 

269 

24.25 

TFQMR 

sorted + block (3 blocks) 

2.2 MB 

248 

18.0 
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5 Conclusions 

The position of the eigenvalues and the singular values of two test Moment Method matrices 
were investigated. Iterative solvers seem to work well for Moment Method matrices if these are 
diagonally dominant. This can sometimes be forced by properly ordering the elements related to 
their geometrical position. Furthermore the use of sophisticated preconditioner is advisable for 
the application of iterative algorithms to dense matrices. 

One of the best ways to improve the overall performance of linear algebra codes is to use the 
architecture specific libraries to obtain nearly the theoretical maximum performance. 
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Abstract 

We investigate the numerical convergence prop¬ 
erties of the combined field integral equation 
(CFIE). The result of the moment method based 
on the CFIE is known to approach the correct so¬ 
lution as the grid density increases, but we show 
that the rate of convergence can strongly depend 
on the choice of the CFIE parameter a. The 
dependence is shown to be greatest when the 
scattering is dominated by non-specular mech¬ 
anisms. We present numerical results for a con¬ 
ducting sphere, trihedral comer reflector, ogive, 
and thin rod. 

1 Introduction 

Surface integral equations (SIEs) form the basis 
of most so-called “numerically exact” computa¬ 
tional electromagnetics scattering computer soft¬ 
ware (codes). They possess the desirable prop¬ 
erties of surface (rather than volume) discretiza¬ 
tion and exact coupling between physically sepa¬ 
rated components of the scatterer. As computer 
capabilities increase, codes based on SIEs may 
be applied to larger and larger problems, and 
the results are often not evaluated against rig¬ 
orous convergence analyses [1]. In many cases, 


“rules of thumb” are used blindly. Such “rules of 
thumb” are derived based on experience gained 
by researchers and practitioners over (hopefully) 
years of use. However, they are prone to be re¬ 
lied upon in situations far different than those 
in which they were derived. Often, “rules of 
thumb” are accepted unquestioningly as gospel, 
and can, in some instances, lead to grave dif¬ 
ficulties. The Air Force is actively pursuing 
highly accurate computational results for various 
canonical target shapes for use as radar cross sec¬ 
tion (RCS) measurement calibration standards, 
which drives the requirement for accurate SIE- 
based codes. 


2 CFIE Formulation 

The combined field integral equation (CFIE) is 
very popular in the analysis of closed conducting 
scatterers. It is based on the electric and mag¬ 
netic field integral equations (EFIE and MFIE, 
respectively) and was first developed to solve the 
problem of real-valued eigenvalues of the EFIE 
and MFIE. The EFIE can be written 
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0.5fH**n.2Opp»..F««y1 MHt, Tx - 0 cfcg.. ft.-Ottog. 


n x E mc = ~rzn x 
jkY 

p.v. J + */vv}^' (i) 

while the MFIE is written 
ft x = i / - ft x P.V. f J x VV ds' (2) 

2 i 5 

where ^ nc and FF nC are the incident fields, S 
is the surface of the conducting scatterer, h is 
the outward unit normal, P.V. / is a principal 
value integral, k — 27r/A is the wavenumber of 
the surrounding medium, Y is the intrinsic ad¬ 
mittance of the surrounding medium, J ~ fix H 
on 5, and ip — exp(—jk\r — r r |)/(47r|r — f*|) is 
the Green function for the (suppressed) exp (jut) 
time convention. 

Mitzner is often credited as originating the 
CFIE [2]; Poggio and Miller [3] present it as 

{(1 - a)(l/2 - £)+ahxM}J = 

(1 - a) ft x ff mc + a Yn x (i? inc x ft) (3) 

where C is the integral operator in (1), M is the 
integral operator in (2), and a is an arbitrary real 
constant between 0 and 1. a effectively weights 
the contribution of the EFIE and MFIE: a — 0 
degenerates to the MFIE, a = 1 degenerates to 
the EFIE, and a = 0.5 weights the EFIE and 
MFIE equally. 

Along with possessing no real-valued eigen¬ 
values, the CFIE is also generally better con¬ 
ditioned than either the EFIE or MFIE. It is 
this latter property that is most attractive to 
computer software writers, as it tends to reduce 
sensitivity to numerical errors in computing the 
impedance matrix terms, or, in the case of itera¬ 
tive solution, reduces the number of iterations 
necessary to reach a given residual threshold. 
In the spirit that the EFIE and MFIE should 
be weighted equally, many practitioners have 
adopted a “rule of thumb” that a value of a — 0.5 
is appropriate to use in nearly all cases. How¬ 
ever, as we shall show in the following sections, 



Figure 1: Monostatic RCS of a 6-inch diameter 
conducting sphere, comparison of CFIE to Mie 
series. The spikes are seen in the pure MFIE and 
EFIE results only. Curves are well-converged at 
a discretization density of 20 points/A. 

the accuracy of the results can vary alarmingly 
with a, especially when non-specular scattering 
mechanisms dominate the far-field scattering. 

3 Numerical Results 

Here we present numerical result for four con¬ 
ducting geometries, analyzed using mature, well- 
known moment method codes. The first target is 
the sphere, useful because it has an exact series 
analytical reference solution. The second tar¬ 
get is a corner reflector whose RCS is dominated 
by specular scattering mechanisms. The third 
target is an ogive, with scattering properties ex¬ 
hibiting a large dynamic range as a function of 
angle and the presence of dominant non-specular 
scattering mechanisms. The fourth target is a 
thin rod, which also has strong “second order” 
physics dominating its RCS. 

3.1 Sphere 

The sphere is the one target for which results 
can be obtained to very high accuracy without 
using a method of moments computational ap¬ 
proach, since the Mie series solution is avail¬ 
able [4]. Comparison of a portion of the scat- 
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tering spectrum using MFIE, EFIE, and CFIE 
(for various values of a) as well as a Mie series 
solution is shown in Figure 1. This illustrates the 
need for the CFIE: values of a — 0 (MFIE) and 
a = 1 (EFEE) result in spurious resonance-type 
phenomena which are clearly anomalies. The 
CFIE was originally introduced to control these 
problems [2]. The figure shows that the anoma¬ 
lies occur for both EFIE and MFIE, but do not 
exist in the CFIE results for 0.25 < a < 0.75. 
As can be seen from the curves, the choice of 
the numerical value of a for this case is fairly 
irrelevant: the agreement with the Mie series is 
reasonably close. Often for problems in general, 
a value of a = 0.5 is chosen somewhat arbitrarily, 
and the discretization is set at computationally 
affordable value, usually at about 20 points per 
wavelength for the highest frequency needed. As 
will be seen in the following paragraphs, this may 
not be an appropriate strategy in general. 



3.2 Trihedral Corner Reflector 

The trihedral corner reflector is a target that has 
recently come under scrutiny for use as a high- 
RCS calibration standard for scattering measure¬ 
ments. Each of the three reflecting surfaces is a 
3-inch by 3-inch square, the thickness of each 
plate is 0.375 inches, and the exterior edges are 
beveled at 45°. The trihedral geometry is shown 
in Figure 2. The maximal monostatic RCS is 
presented when the projection onto the incident 
field plane is largest. 

Like the sphere, the scattering from the tri¬ 
hedral is dominated by a specular mechanism. 
However, the CFIE calculations show a signifi¬ 
cant variation as a function of grid density and a, 
as shown in Figure 3. For some applications, the 
variation in RCS is within tolerance, but for cali¬ 
bration purposes the variation translates directly 
into overall RCS measurement uncertainty. In¬ 
deed, measurement uncertainty is bounded be¬ 
low by uncertainty in the reference RCS of the 
calibration standard. 


Figure 2: Trihedral Geometry. The trihedral 
is built out of 0.375-inch thick plates, exterior 
edges beveled at 45°. The three interior (re¬ 
flecting) surfaces are square, 3 inches by 3 inches 
each. 


3’ Trihedral Bcwight RCS, 3.2 GHtK Pol 



Figure 3: Monostatic RCS of trihedral. Compu¬ 
tations were made with a general-purpose MoM 
code using an iterative matrix solver employing 
the fast multipole method. 
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3.3 Conducting Ogive 


10 ft Ogive, F= 150 MHz, Monostatic, 200 pi/K, legend: Pol, ilpfaj 



Figure 4: Monostatic RCS of the 10-foot ogive 
at 150 MHz. Results are plotted for both po¬ 
larizations and a equal to 0.25, 0.50, and 0.75. 
Discretization density is 200 points/A! Note that 
the lower curves (<jxj> pol) are well converged, but 
there is still some separation in the 66 pol as a 
function of a. 


10fl Ogive, F = 150MHz,Monostatic,TTPol ,a = 025 



Figure 5: Monostatic RCS of the 10-foot ogive 
at 150 MHz, 66 polarization. Results are plotted 
for various discretization densities. Note that at 
40 points/A the results are far from converged, 
even though such a fine discretization is normally 
considered more than sufficient. 


The conducting ogive is an object which is very 
useful for RCS measurement range diagnostics. 
At nose-on incidence, it has low backs cat tering 
levels in both principal polarizations, and its 
wideband signature has trends opposite to that 
of a sphere: it starts high at low frequencies, and 
slowly falls as frequency increases. In the “down- 
range” domain, its signature is seen as princi¬ 
pally dominated by tip responses, so that tar¬ 
get mounting levels can be easily diagnosed and 
controlled, especially for large geometries. The 
ogive can exhibit high dynamic range (50-70 dB 
from nose to side at high frequencies) and, being 
a body of revolution (BOR), is in principle cal¬ 
culable using a “numerically exact” BOR formu¬ 
lation of a CFIE code [5]. The results given here 
are for a large conducting ogive target, 10 feet 
in length and one foot in diameter at the center 
(recall that an ogive is formed by rotating a cir¬ 
cular arc about its chord). At one polarization 
(E-field in the scan plane, i.e., 69 polarization), a 
traveling wave dominates the near nose-on scat¬ 
tering for a significant angular extent near 30° off 
the ogive axis. Although this is very clear at fre¬ 
quencies where the object is electrically large, we 
show lower frequency results only, due to com¬ 
puter run-time considerations. Figure 4 depicts 
the results for both principal polarizations for 
the 10-foot ogive at 150 MHz. Results for a range 
of a values are shown. For the non-traveling 
wave polarization or “PP” on the plots), 
all three curves are identical on this scale. Note 
that this was obtained using 200 points per wave¬ 
length. Figure 5 gives the results for a set at 
0.25, for various values of points per wavelength. 
The inaccuracies seen in the traveling wave lobes 
is very high indeed, and is still almost 10 dB from 
the final converged answer at discretization of 40 
points per wavelength. Note that many engineer¬ 
ing computations are performed at a maximum 
of 20 points per wavelength due to problem size 
and computer time constraints. Similar results 
have been obtained for various values of a; the 
convergence for some a values is faster than oth¬ 
ers. All eventually converge to the same curve 
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sets as in Figure 4, but at discretisation values 
far higher than is practical for many computa¬ 
tional problems. 

3.4 The Long Thin Rod 

Another calibration target of interest to the RCS 
measurement range is the long thin rod. At 0° el¬ 
evation (rod axis horizontal to the ground), there 
is a co-polarization signature (00 and <j}<f>) from 
the rod, but no cross-polarization signatures (0<j> 
and (j>0 ). However, at 45° elevation (rod axis 
forms a 45° angle with respect to the ground), 
both co-polarization and cross-polarization sig¬ 
natures exist. The 0<j) and $0 signature should 
be the same when the 45° elevated rod is broad¬ 
side to the radar. This configuration provides 
a check for the cross-polarization quality of the 
measurement. Even though for this case the pat¬ 
tern cut is not on one of the principal planes of 
a BOR target, BOR codes can still be used to 
generate the theoretical results. This is done by 
calculating the 00 and <jxj) signatures at 0° ele¬ 
vation then performing a coordinate transforma¬ 
tion to correctly predict the signature for the de¬ 
sired pattern cut. The cross-polarization arises 
due to the coordinate transformation. The re¬ 
sults shown in Figures 6 and 7 are for 0° eleva¬ 
tion at 140 MHz. The rod is 12 feet long with a 
1.66 inch diameter. For 00 polarization the EF3E 
results converge at a grid density of 50 points per 
wavelength while the CFIE (with a = 0.5) fails 
to converge even at 120 points per wavelength. 
For polarization the CFIE results converge 
at 90 points per wavelength while the EFIE still 
has not converged at 120 points per wavelength. 
This case demonstrates that the choice of a and 
grid density is highly sensitive to polarization, 
even for the same target! 

4 Conclusions 

In this paper, we have discussed the role of 
the combined field integral equation in compu¬ 
tational electromagnetics, concentrating on its 
convergence properties. We have presented re¬ 
sults showing that we must use CFIE became it 


Thin Rod RCS vs Angle — 140 MHz, 60 Pol 



Figure 6: Monostatic RCS of long thin rod at 
140 MHz, 00 polarization. The rod is 12 feet 
long and 1.66 inches in diameter. EFIE means 
a = 1 and CFIE means a = 0.5; ’ppw’ means 
poinfcs/A. Note the very poor convergence of the 
CFIE results. 


Thin Rod RCS vs Angle — 140 MHz, $<j> Pol 



Figure 7: Monostatic RCS of long thin rod at 
140 MHz, <jxj) polarization. Note the poor con¬ 
vergence of the EFIE for <j></> polarization. 
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controls the “resonance anomalies” that plague 
pure EFIE and MFIE results. These anoma¬ 
lies are unpredictable and will be found in wide¬ 
band results that are needed in modem scat¬ 
tering analyses. We have demonstrated that 
the accuracy of the CFIE results depend very 
strongly on both the value of a weighting the 
EFEE and MFIE contribution, and the fineness 
of discretization (points per wavelength). The 
most distressing aspect we discovered is that the 
value of a that produces the fastest convergence 
is problem-dependent and may or may not be 
consistent with matrix condition number. The 
quickest convergence is usually found for targets 
that are dominated by specular-type scattering 
mechanisms ( e.g sphere, cylinder, comer reflec¬ 
tor), and is not overly sensitive to choice of a 
for most (but not all) applications. The slowest 
convergence is exhibited for long thin scatterers 
which support traveling waves, such as the ogive 
and the rod. In particular, convergence of the 
traveling wave lobe is exquisitely sensitive to the 
choice of a. Convergence properties at lower fre¬ 
quencies may or may not show the same trends 
at higher frequencies. In other words, if a — 
0.75 gives the fastest convergence at 150 MHz 
(where it is easy to run a thorough and wide- 
ranging convergence test), it may or may not be 
the fastest-converging a for 10 GHz (where such 
a test would be impractical in terms of compu¬ 
tational resources). It is this lack of a general- 
izable rule that causes the most concern among 
the authors. We hope that further research can 
quantify the reasons for this in the near future. 
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Abstract This paper discusses the modeling of various kinds of surface junctions in an 
SIE/MoM (Surface Integral Equation / Method of Moments) formulation applied to complex 
objects consisting of arbitrarily shaped conducting and dielectric bodies. Methods of describ¬ 
ing various types of junctions and systematically incorporating them in numerical solutions are 
presented. The procedures are of interest for the specific application of arbitrarily shaped di¬ 
electric resonator antennas and their associated feed structures and packaging. An E-PMCHW 
formulation in conjunction with a moment method procedure using generalized triangular basis 
functions is presented to deal with such general junctions. 

I. Introduction 

The modeling of general surface junctions in an SIE/MoM formulation is considered in this 
work. The specific application leading to this study is that of a dielectric resonator (DR) antenna. 
Since an experimental study of a cylindrical dielectric resonator (DR) antenna was reported in 
1983 [1], this antenna has drawn continued interest because of its small size, efficiency, and 
potential ability to perform multiple antenna tasks via simple mode coupling mechanisms. The 
configuration of a DR antenna may range from a very simple one which allows analytic solutions 
to a very complex one. A typical structure for a DR antenna is a DR element of high dielectric 
constant excited by a single feed such as a microstripline or coaxial cable. Various shapes and 
combinations of DR elements as well as various feed structures have been suggested, however, 
which may improve the antenna performance in the areas of bandwidth, power handling, and 
antenna efficiency. 

Rigorous SIE analysis methods for non-trivial DR antenna configurations have been available 
mainly for body of revolution (BOR) objects [2,3]. This work results from an interest in the 
analysis of DR antennas of more arbitrary configurations, which may include general 3D objects 
comprising an arbitrary combination of conducting and/or dielectric bodies of arbitrary shapes, 
using an SIE/MoM method with triangular patch basis functions. The junction modeling problem 
has been considered in previous works for conducting surfaces [4], for dielectric surfaces [5], simple 
combinations of both for BOR objects [2,3,6,7], and for a general case of conducting, dielectric, 
resistive, and impedance boundary condition surfaces [8]. This work attempts to provide a 
formalism for systematically describing junction models for a wide variety of junction types. 

II. Formulation 

A. Problem Description 

The geometry under consideration is a general inhomogeneous body with Nr dielectric regions, 
each of which may contain conducting bodies as well as impressed sources as shown in Fig. 1. 
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The regions have permittivities e* and permeabilities /x,, where i ~ 1 ,...,Nr. Both e* and fa 
may be complex to represent lossy materials. Non-zero thickness conducting bodies denoted by 
Ro may occupy any parts of the space. Infinitely thin conducting bodies can reside in any region, 
at interfaces between regions, or they may penetrate from one region to another. All conductors 
are considered to be PEC (Perfect Electric Conductor) material. One of the regions, region R\ 
in Fig. 1, may be of infinite extent. The total fields in each region are denoted by E , and H*, 
where i = 0,1,2,, Nr, for electric and magnetic fields, respectively, and i — 0 denotes PEC 
regions with E 0 = Ho = O. The time variation, e***, is assumed and suppressed throughout. 

Any two adjacent regions, Ri and Rj, are separated by a surface denoted by Sij(t s , t, /), where 
t 3 is the type of the surface, and t and / are the ‘to-region’ and the ‘from-region’ of the surface, 
respectively, which define the region connectivity and the surface orientation. The interface 
between a non-zero thickness conducting body and a dielectric region also forms a surface denoted 
in the same way with the ‘from-region’ being region zero. An infinitely thin conducting body in 
a dielectric region forms yet another type of surface with the ‘from-region’ being the same as the 
‘to-region. Thus, as shown in Fig. 1, there are four types of surfaces specified by t s — 

• Type-0 (t 9 = 0, pfO) ; interface between a conducting body and a dielectric region, 

• Type-1 (t a = 1, pfl) ; infinitely thin conducting body within a dielectric region, 

• Type-2 (t s = 2, pf2) ; infinitely thin conducting body between two dielectric regions, and 

• Type-3 (t 5 = 3, df) ; dielectric interface between two dielectric regions. 

When more than two surfaces meet at a curved line segment, they form a junction. Depending 
on the numbers and types of the surfaces at a junction, there are a variety of possible junction 
types, all of which are considered in this study. 

Each region Ri is surrounded by a closed surface Sf and is associated with an inward normal 
unit vector n*. The surface interface between regions Ri and Rj, if one exists, is denoted as Sij, 
for any i and j, i = 1,..., Nr, j = 0,1,..., Nr. Thus, Sf is the set of all interface surfaces 
Sij, where j represents all region numbers that interface with region Ri. Note that Sij = S# for 
j ^ 0; however, the normal unit vectors n* and % are in opposite directions to each other on 5Jy. 


S41 S31 



_ PfO (tg — 0) 

- Pfl (t 5 = l) 

- pf2 (t s = 2) 

-df = 3) 



(Ei, Hi) 


Fig. 1. General geometry under consideration. 


684 







B. The Field Equivalences 

According to the equivalence principle [9], the original problem can be decomposed into N R 
auxiliary problems, one for each dielectric region. To obtain the auxiliary problem for region Ri, 
the impressed sources of the original problem are retained only in region Rj and the boundaries 
of the region are replaced by equivalent surface currents radiating in a homogeneous medium 
with the constitutive parameters of region Ri. Electric currents are used for the conducting 
surfaces, while electric and magnetic currents are used for the dielectric boundaries. The electric 
and magnetic currents appearing on opposite sides of a dielectric interface in different auxiliary 
problems are taken equal in magnitude and opposite in direction to assure the continuity of the 
tangential components on these boundaries as they are continuous in the original problem. In 
this procedure, the fields produced within the region boundaries by the equivalent currents and 
the impressed sources in region Ri must be the same as those in the original problem, while the 
zero field is produced outside these boundaries. The electric and magnetic currents along Sf are 
Jj = TLi x Hj and M £ = E £ x f^, respectively. 

A system of surface integro-differential equations can be obtained by enforcing the boundary 
conditions of continuity of the tangential components of electric field on the conducting surfaces 
and both electric and magnetic fields on the dielectric surfaces. This results in the E-PMCHW 
formulation [6] when there is no junction in the problem. For problems having general junctions, 
however, it is not easy to express the integral equation system explicitly apart from the testing 
procedure. Thus the system of integral equations is presented in the next section after describing 
the junction modeling and the basis functions. 


C. Modeling of Junctions in the Moment Method Solution 


Arbitrarily shaped surfaces are discretized in triangular patches and the equivalent surface 
currents are approximated by expansions in the RWG basis functions on the patches, which are 
expressed as [10] 

N Tj 

J(r)^£j n B£(r;S£,S£) (1) 

n=l 


where, 



0 , 


r 6 Sy n 
otherwise, 


( 2 ) 


Ntj is the number of electric basis functions, and S^ n are the positive/negative domains or the 
from-/to- faces of the basis function, respectively. For magnetic currents, can be 

defined similarly. The testing functions Tn and T„ m are also taken to be the same as (2). With 
the basis and testing functions defined we have a matrix equation 


• Z 7j Tj T IjT m 

r \& > 1 


|V*> > 1 

T T m Tj Y TmT ™ 

> J 


iv 7 - > J 


When there are general surface junctions, the current related to an unknown coefficient may 
exist on many different surfaces. In such cases, the expression (1) is not rigorous enough. For 
example, there is an electric current on a dielectric surface in the region Ri equivalent problem 
and another one flowing in the opposite direction in the region Rj problem, represented by ‘—/ n ’ 
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as shown in Fig. 2(a). The expression in (1) for the electric currents has this sort of implication 
for the basis functions Bn when the domain of the unknown involves a dielectric interface. When 
more than two dielectric surfaces meet at a junction, this scheme does not work. Thus for general 
junctions, we seek another way of expressing the generalized current more rigorously. We will 
use two different basis functions for the same unknown coefficient related to a dielectric surface 
as shown in Fig. 2(b). In other words, the unknown coefficient has a multiplicity of two when it 
represents the electric or magnetic current on the dielectric face. Extending this to the general 
case, the generalized current is defined in terms of the generalized basis functions as 

Nt j n 

C(r) = {J(r),M(r)} = {£/ n B n (r), £ I n B n (r)} (4) 

n=l+N T j 

where, 

B„(r) = Bjp(r), B n „(r) = B^r), T„ = r*, with k = n, if n<N Tj (5) 

B„(r) = B* m (r), B„ t (r) = B£"(r), r„ = r k , with k = n - N T . , if n > N Tj (6) 


N 


B?(r) 




b£,b£ 


Tirtf 
'ft'dfn 
ffn v > tfnv 

Rnv 


+ Nr m 

EBg(r(7) 

t?=l 

EBl’fr ;/&.,*/*,*.) (8) 

V~\ 

the v th basis function of I n , v — 1,..., r n 

RWG basis function defined over the corresponding patches as in (2) 

multiplicity of the unknown coefficient, I n = 

total number of faces (surfaces) connected to the junction for I n 
number of dielectric faces (surfaces) related to I n 
from-face and to-face of . 
region of B^. 


’R'dfni Tldfn — *Hf 

n<ifn + 1> otherwise 


fl 



ft 


fl 



i2,- (/l)/2j 72j) 


(a) Conventional representation (b) New representation 

Fig. 2. Two methods of representing basis functions. 
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Notice that there is one-to-one correspondence between bJ, or Bjj* and the parameter set 
{ffnvJfnviRnv}- The numbers of unknowns and basis functions for a given junction or edge 
are determined from the types and numbers of the faces connected to the junction by considering 
proper boundary conditions at the junction. The methods of determining them and systematical¬ 
ly incorporating them in the MoM solutions have been developed. Examples of modeling general 
junctions are shown in Fig. 3, where J n and M n are used instead of Bn J and B£ m , respective¬ 
ly. The generalized testing functions and {T m }m=i are also defined in a 

similar manner. We also define Cf, the generalized current for the region Ri equivalent problem, 


as 


Cj(r) = {JiW.MiCr)} 


(9) 

where 

N Tj 

T« 




w 

II 

£ 

( r j ffnv 5 tfn v ) Rnv) 


(10) 


n=l 

v-l 




N 

T n 




Mi(r) = £ 

In £ ( r > ffn v > tfny ? Rtiv ) 


(11) 


ti=Ntj+1 

0=1 




S = source contribution coefficient = | 

Rnv — Ri 
otherwise 



With the set of basis functions in (4-8), one may apply the boundary conditions of tangential 
field continuity at each sub-domain of the basis functions. By merely applying the boundary 
conditions, however, the total number of equations may be greater than the number of the 
unknowns because of the multiplicity of some unknowns related to junctions. The usual methods 
of solving equations apply only when the number of equations equals to the number of unknowns, 
N. Such a set of N equations can be obtained by taking the n integral equation as the set 
of simultaneous integral equations (or summation of them) which satisfy the proper boundary 
conditions on the subdomains of the basis functions (B nv ,t; = related to the unknown 

coefficient, I n . It is possible to get such a surface integral equation system by testing with the 
generalized testing functions as follows 


Nr T m 

£< E ?( c *)ltan>£ C T »> 
1=1 u =1 

Nr T m 

£( lii x Hf(Cj), £CiT^> 


-^lltan? £ )> 

i=l t4=l 


m = 1,2, 


Nr , Tm 

— £( rii x Hj, £<C T ™*>> 

X=1 U=1 

m = Ntj 4 - 1 ,... } Ntj + N Tm , 


where, 


(f,g> 
C i 

Rm^ 


[ f ■ g ds 
Js 

field contribution coefficient = 
region of the testing function, , 



Ri — Rmu 
otherwise 




( 12 ) 


(13) 
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Fig. 4. A test case, (a) PEC alone (b) with phantom dielectric. 


and (Ef, Hf) and (E-, H\) are the scattered fields due to C, and incident fields, respective¬ 
ly. Equations (12) and (13) are the E-PMCHW formulation [ 6 ] extended to general junctions. 
Substituting C* into ( 12 ) and (13), the impedance matrix and excitation vector elements in (3), 
Zmn J and Vm , for example, are expressed as 

Nr Tn Tm 

%mn j = ( [®< (X* ( r i ffth> > tfnv 3 ^rO)]tan> X^ (*“3 //m* > tfmu j ^m«) )> 

t—1 T7=l «—1 

n = l,...,N Tj and m = 1,(14) 

Nr . Tm 

Vm == — XI ( ^iltan? ^m..i^mti( r 3 //t7W3 tflTLul )i TTl — lj •••) Ntj (15) 

i=1 «=1 


Some subroutines of EMPACK [11] have been used for the integrations over the triangular do¬ 
mains which appear in (14) implicitly. 


III. Numerical Results 

A T-shape junction of three 0.1-m wide and 0.3-m long PEC strips is taken as an example. For 
comparison a semi-circular cylinder of phantom dielectric having 0.1-m height and 0.3-m radius 
is attached to the T-shape junction as shown in Fig. 4. The ^-directed surface currents along 
the contour lines, (—0.3,0,0) -»(0.3,0,0) and (0,0,0) -> (0, -0.3,0), located at the center of each 
strip are computed for a plane wave excitation. The plane wave is expressed as E mc = £ 0 
where, k i = -x cos <£* sin O' — y sin sin 6 l —z cos 0*, E a —E' d (x cos 0* cos 4> l +y cos 0* sin </>* — z sin 0*), 
0* = = 45°, El = 1, k 0 = 27rf y /jl 0 e 0 , and / — 300 MHz. The results in Fig. 5 show very good 

agreement as well as the expected singularities at the end of the strips. 


IV. Conclusion 

A systematic procedure for modeling of the general junctions of any combination of conducting 
and/or dielectric bodies in an SIE/MoM formulation has been presented. With the successful 
modeling of general junctions, it is possible to apply the E-PMCHW formulation to a large class 
of problems including dielectric resonator antennas of complex configuration. 
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Fig. 5. z-directed current densities along the contours (o ; PEC alone, x ; with phantom dielectric). The 
arrows denote the start of the second contour. 
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1 Introduction 

Recently, higher-order numerical methods have received intense attention in computational elec¬ 
tromagnetics [l]-[4] because of their ability to represent fields/currents and model geometries more 
accurately than conventional low-order methods. Among the present higher-order methods, one 
class uses higher-order basis functions in projection methods such as the finite element method 
(FEM) and Galerkin-based method of moments (MoM) [l]-[3]. The other class uses the Nystrom 
approach with high-order discretization [4]. 

In the first class of higher-order methods, the basis functions used to expand the unknown 
surface current have an important property in that their normal components are always continuous 
across the sides of triangular patches representing the surface of the object to be analyzed. This 
property precludes the infinite accumulation of electric charges along the patch sides and thus per¬ 
mits the use of Galerkin’s method in the solution of the integral equation for the surface current. 
Among the various efforts on the development of higher-order basis functions, Wandzura [1] con¬ 
structed a hierarchy of higher-order functions, in which the well-known Rao-Wilton-Glisson (RWG) 
function [5] is the natural first member. Graglia et al [2] developed a set of fully interpolatory 
higher-order vector basis functions of the Nedelec type [6] in a unified manner, also based on the 
RWG function. 

In the second class of higher-order methods, the unknown surface currents are represented by 
their samples at a set of discrete points on the surface of the object [4]. The Nystrom approach is 
then employed to discretize the integral equation to be solved. Additional efforts are often required 
to deal with the singular integrals by using local correction techniques, which use either the LU 
decomposition or the singular value decomposition (SVD) to solve certain matrix equations. 

Comparing the two classes of higher-order methods discussed above, the first class produces 
more accurate solutions for the same number of unknowns. However, the evaluation of the integrals 
in the first class of methods is more time consuming because there are two fold integrations to be 
evaluated numerically. Additionally, the basis functions used in the first class require the mesh 
of triangular patches to be well connected: a side of a triangular patch has to be entirely shared 
by another patch and no vertices of a triangular patch can reside in the middle of its neighboring 
patch’s sides. This requirement is also found in the FEM. However, for many practical applications, 
this requirement is too stringent and it precludes the use of defective meshes. 

This paper proposes a set of novel, grid-robust, higher-order vector basis functions for the 
MoM solution of integral equations for three-dimensional (3D) electromagnetic problems. These 
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basis functions are defined over curvilinear triangular patches and represent the unknown electric 
current density within each patch using the Lagrange interpolation polynomials. The highlight of 
these basis functions is that the Lagrange interpolation points are chosen to be the same as the 
nodes of the well-developed Gaussian quadratures. As a result, the evaluation of the integrals in 
the MoM is greatly simplified. Additionally, the surface of an object to be analyzed can be easily 
meshed because the new basis functions do not require the side of a triangular patch to be entirely 
shared by another triangular patch. The proposed basis functions are implemented with point¬ 
matching for the MoM solution of the electric-field integral equation (EFIE), the magnetic-field 
integral equation (MFIE), and the combined-field integral equation (CFIE). Numerical examples 
are presented to demonstrate the higher-order convergence and the grid robustness for defective 
meshes using the new basis functions. 

2 Vector Basis Functions 

Given an object, its surface can be meshed into curvilinear trianglular patches. The current on the 
surface can then be represented as a summation of the currents on each of the patches: 

J(r) = £j p (r) (1) 

P~ 1 

where J p (r) denotes the current on the pth patch. The J p (r) can be interpolated by the Lagrange 
interpolator: 

h _ 

Jp( r ) = £-k(i,p)( r ) ' Jp( r i) (2) 

where r,- (i = 1,2, ..., I p ) are the interpolation points and L(j >p )(r) is called the interpolator with 
the property that 

i»(i,p) ( r i) ~ (3) 

where is the Kronecker delta function and I is the unit dyadic. Equation (2) can also be 
interpreted as the expansion of the current using L( ijP )(r) as the basis functions and J p (r*) as the 
unknown expansion coefficients, which are to be determined. 

With the property in Equation (3), the evaluation of the surface integrals in the MoM can be 
greatly simplified. For example, consider a surface integral over a triangular patch 

[ f(ry)L M (r')-J p (ri)dS' (4) 

j&p 

where /(r, r') is the kernel, which may be singular at r = r'. When the point r is not close to the 
point r' in triangular patch p, the Gaussian quadrature yields 

/ 

^U) s (r s )/(r,r s )L (i ,p)(r s ) ■ Jp(rj) (5) 

s=l 

where / is the number of points in the quadrature rule and w s denote the weighting coefficients. 
If the quadrature points are the same as the interpolation points in the Lagrange polynomial, 
Equation (3) can be used to give 

u>i(ri)f(r,ri)I- J p (n) (6) 

which greatly simplifies the result of integration. 
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V 



r-space (k,v) space 


Figure 1: A curvilinear triangular patch in the r space mapped to a planar right-angle triangle in 
the (u, v) space. 

To design the basis functions L^ p )(r), we first map a curvilinear triangular patch in the r space 
into the (u, u) space, as shown in Figure 1. Since the surface current has two degrees of freedom, 
we can write any surface current in the (w, u) space as 

J = -‘-[J !I u +J^v] (7) 

Jac 


u du' v dv 


in which 


Jac = V 911922 — 912921 


9 11 = u • u, q 12 = 92 \ - v ■ u, 922 = v • v. 


From Equation (7), the current at any point on the patch can be written as 

J i = y— [JiuUi + 

Jaci 

The current J can then be interpolated by the Lagrange interpolator as 

i P ^ 

J = ~T~ Li(u,v)[Ji u \ii + J iv Vi] 

1 Jaci 

where Li(u,v) is a two-dimensional Lagrange interpolator with the property that 


Li(uj,Vj) — 5{j. 


Equation (12) can also be written as 


p 1 

j = 5Z T~ Li ^ v )t u ta22Ui - 9il2Vi) + v(9in\i - 9i2lUi)] • J i 
j =1 Jaci 

h _ 

= ■ J i. 

i-l 
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Table 1: Information about the vector basis functions and the chosen quadrature rules. 


Order of 
basis functions 

Number of 
interpolation points 

Accuracy of 
quadrature rule 

Unknowns in 
each patch 

0 

1 

1 

2 

1 

3 

2 

6 

2 

6 

4 

12 

3 

12 

7 

24 


It then remains to design Lj(u, v) in the r space to satisfy Equation (3) or Li{u,v) in the (u, v) 
space to satisfy Equation (13). In addition, we also hope the Lagrange interpolation polynomials 
Li(u,v) to have the following properties. First, the interpolation points are distributed inside 
a triangular patch. Second, the interpolation points are the same as the nodes of a Gaussian 
quadrature rule. Third, the chosen interpolation points should give the highest degree of accuracy 
among the quadrature rules that have the same number of interpolation points. 

The three properties above provide a guideline to choose the Gaussian quadratures. Once the 
quadrature points are chosen, one can proceed to find the explicit expression of Li(u,v). In the 
(u, v) space, we can define the space of n-degree polynomials [7] as 

P% — span-fuV 7 ; i, j < n} (15) 

such that the dimension of the space is 

dimPn = Cn-t-2 = ( 16 ) 

For n — 1, dimP^ = 3, and P% = span{l,u,u}, and in this case, we choose the interpolation points 
from the three-point quadrature rules. For n = 2, dimP^ = 6, and P% = span{l,u, u 2 , uu,v 2 }, 
and consequently, we choose the interpolation points from the six-point quadrature rules. This can 
be extended to the higher-order ones in a straightforward manner. 

Once the polynomials P 2 are chosen, L{{u 1 v) can be determined by solving the matrix equation 


-Pl(ui,Ul) P\{U2,V2) * 
P2{u 2 ,V2) • 

♦ Pl{u n ,V n ) 

P2(lift)U n ) 


Li(u,v) ‘ 
L 2 {u,v) 


Pi(u,v) 

p2{u i v) 

_Pn(u P n (u 2i V 2 ) ’ 

•• Pn(Un)U n ) 


L n (u,v) _ 


P n {u,v) _ 


where (w t , Vf) denote the interpolation points. If the Vandermonde determinant of the above equa¬ 
tion is nontrivial, Li(u,v) can be obtained uniquely. Therefore, an additional requirement for the 
interpolation points is that they should guarantee \VDM\ £ 0. 

Table 1 lists the information about the zeroth-, first-, second-, and third-order vector basis 
functions that are designed based on the procedure described above. The zeroth-order basis func¬ 
tion is the same as the well-known pulse function. For the zeroth-, first-, and second-order basis 
functions, we choose the quadrature rules [8] whose number of points matches the degrees of poly¬ 
nomials, dimP 2 . For the third-order basis functions, we find that the available 10-point Gaussian 
quadratures [9], [10] yield an ill-conditioned matrix in Equation (17). Hence, we choose to use the 
Gaussian quadrature with 12 points [11], which yields a well-conditioned matrix. In addition, the 
12-point Gaussian quadrature rule has the 7th degree of accuracy while the aforementioned 10-point 
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Figure 2: RMS error in the bistatic RCS versus the number of unknowns per square wavelength 
obtained using the CFIE. The solid lines correspond to a = 0.2 and the dash lines correspond to 
a = 0.5. 

rules have only the 4th or 5th degree of accuracy. This choice yields a little more unknowns, but it 
also gives a higher integration accuracy. For the third order, the polynomial space is spanned by 

{1, u, v , u 2 , uu, v 2 , u 3 , u 2 i>, uv 2 , try 3 }. (18) 

3 Numerical Results 

We have successfully implemented the new basis functions in the MoM solution of the EFIE, MFIE, 
and CFIE using the point-matching approach. The EFIE and MFIE are given by 

t • ^ G(r,r') • 3(r-)dS' = -t • E inc (19) 

and 

^-^nxVx| S (r, r')J(r ')dS' = n x H inc (r) (20) 

respectively, where G(r,r') is the dyadic Green function, <?(r,r') is the scalar Green function, t 
and n denote the tangential and normal unit vectors on the surface s, respectively, and finally 
(EfncjHinc) denote the incident fields. The CFIE is simply a linear combination of EFIE and 
MFIE: 

aEFIE + (1 — a)?7MFIE (21) 

where a denotes the combination constant, usually chosen from 0.2 to 0.8, and r\ is the free-space 
wave impedance. For the singular integrals in the EFIE and MFIE, Duffy’s transformation [12] is 
used for their evaluation. 

A conducting sphere having a diameter of 1 m is used to test the MoM solution using the 
new higher-order vector basis functions. The parameter used in the study is the root-mean-square 
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(a) (b) 


Figure 3: (a) An inscribed regular octahedron, (b) The front half of the octahedron is rotated by 
an angle of A<j>. 

(RMS) error defined as 



where cr ca i denotes the calculated radar cross section (RCS) and cr Te f denotes the reference Mie 
solution, both measured in dB, and N s is the number of sampling points, which are the angles of 
observation here. 

Figure 2 shows the convergence behavior of the higher-order basis functions obtained at 0.3 
GHz. It is clear that when the number of unknowns increases, the results obtained using higher- 
order basis functions converge more quickly than those of lower-order ones. This is particularly 
true when a high solution accuracy is desired. 

Next, the proposed basis functions axe validated on defective meshes. Here, a defective mesh 
refers to one in which an edge of a triangular patch is partially, instead of totally, shared by another 
triangular patch, or in other words, two neighboring triangular patches partially share one edge. 
The validation is carried out on a 1-A diameter sphere using the MFIE with the third-order basis 
functions. By mapping the triangles of an inscribed regular octahedron onto the surface of the 
sphere, we obtain an exact description of the sphere with well-connected curvilinear triangular 
patches, as illustrated in Figure 3(a). If we rotate the front half of the octahedron by an angle of 
A <p and then map the eight triangles onto the surface of the sphere, we obtain a defective mesh, 
which also describes the sphere exactly, as shown in Figure 3(b). By subdividing each of the eight 
triangles into four subtriangles, we generate a mesh with 32 triangular patches, which is used for 
the study here. Figure 4 shows the RMS error as a function of the rotating angle A <f>. It is seen 
that the RMS errors from a series of defective meshes are within ±0.004 dB of the RMS error of 
the well-connected mesh, which is 0.013 dB. This demonstrates clearly the grid-robustness of the 
proposed new higher-order basis functions. 
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Figure 4: RMS error in the bistatic RCS versus the angle of rotation. 


4 Conclusion 

A set of novel, grid-robust, higher-order vector basis functions was proposed for the MoM solution 
of integral equations for 3D electromagnetic problems. These basis functions were defined over 
curvilinear triangular patches and based the Lagrange interpolation polynomials. The Lagrange 
interpolation points were chosen to be the same as the nodes of the Gaussian quadratures, which 
greatly simplified the evaluation of the integrals in the MoM. The new basis functions were also 
applicable to defective meshes where the side of a triangular patch is only partially shared by 
another triangular patch. The proposed basis functions were implemented with point-matching for 
the MoM solution of the EFIE, the MFIE, and the CFIE. Numerical examples were presented to 
demonstrate the higher-order convergence and the grid robustness for defective meshes. 
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1 The microstrip problem. 

The current distribution on a microstrip structure radiates an electric field given by Pocklington’s integral equation 

E(f)=jup o JJ G{r-, ds' 

In order to solve Pocklington’s equation for the current density J s , the method of moments (MoM for short) is 
employed. MoM solves this equation dividing the structure into M different segments called patchs. The unknown 
current distribution, J s , is written as a linear combination of two-dimensional basis functions which are defined 
over rectangular segments. 


m M 

T. = y')x + £ Iy,J yi {x',l/)y (1) 

1=1 1=1 

Basis functions can be written as the product of two independent variable functions; J x (x,y) = T(x)Q(y) and 
J y [x, y) s= Q{x)T(y). Usually, the longitudinal component has a piecewise sinusoidal or triangular variation and the 
tranversal component has a constant distribution. So, the shape of one of these basis functions can be something 
like figure 1. The components of the diadic Green’s function can be written using G x and G v Sommerfeld integrals. 



Figure 1: Basis function. 
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gb “ = 

k ° G * + h Gv ] 

Ge ‘* = 2nk$ 

afe®'] = Ge 

° E »’ = 

**■+&*] 


Projecting the electric field along the x and y axis and using Galerkin’s method, Pocklington’s equation will reduce 
to a matrix equation: 

(*&l i = l..M 

\Zyx Zyy) 2 Mx2M Vi' 2Mxl / 2Mxl ^ 

The elements of the matrix [Z] are integrals of the following type. 

r*j+hj pzi+hi ryi+u>2i fyj+wij 

/ dx' dx' dy' I F i (x , ‘,y > ) Fj{x;y) G(x - x',y - y')dy (6) 

Jxi—ltj dxi—lu Jyi->Jyj—wn 

where G is the Green’s function of in a grounded dielectric slab and Fi(x',y') and Fj(x,y) are basis or weight 
functions. 

Table 1 shows the value of these functions for each integral. This kind of integrals are defined over rectangular 
shapes. Since the functions Fi and Fj are defined within patches , the couple between patches i and j has to be 


Integral 1 


#(*,</) Fj{x\y 


Integral 2 




G v (x,y;x' y tf) 


Table 1: [Z] functions. 

computed by parts. This calculation will be accomplished through the sum of the couplings of each one of the 
subdomains in which is divided patch i with each one of the subdomains of patch j. 

Terms V x and Vj are obtained using a the delta-gap voltage excitation model. This model assumes the excitation 
is accomplished through a voltage source applied within a infinitesimally small gap. Terms [Z] are quadruple 
integrals that can be directly calculated using a Gauss-Laguerre quadrature but their cost would be too large. To 
minimize this cost, a domain change transforming (6) to a double sum of double integrals is accomplished. Using 
this transformation, the calculation time reduces considerably. 
rxj+hj r*i+h, rVi+viyi ryj+W2j 


rxj+hj rm+ht ryi+vi2i ryj+v>2j 

/ dx' dx' dy' Fi{x'\y') Fj(x\y) G{x - x\y - y')dy 

Jzi—lij Jyi—wi, Jyj—ivu 

12 12 .ULS l rVLS k 

= du /m (u) • far* (v) ■ G{y/2u + Xj - x { , + yj- yi)dv 

l=i k=i Julj l Jvu k 


The integration limits in (7) are different for each integral and once (7) has been obtained, the elements of the 
matrix [Z] are numerically calculated. 

2 Behaviour of Green’s functions. 


As shown in 2, 3 and 4, the components of the diadic Green’s function can be written using two auxiliar functions; 
G x and G v . These two functions are also known as Sommerfeld integrals or Hankel transforms and their analytical 
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expressions are: 


g* = ( 8 ) 

JO -Ute 

G, = f" jr^r-ZMPOdt. (9) 

JO Ute-Lstm 

for a substrate of height d, dielectric constant e r and losses factor tan S. 

D te = *2 - jh cot(kid) D tm = c t k 2 +jk x tan^d) N = k 2 + j&i tan(JM) 

^2 = \Ao ~ 4 2 = ^o 2 - £ 2 = e r (l - i tan J) 

The behaviour of the integrands in Gx and G v when f ->• oo is something like Foo{£) = Yln= o£ nc n which 
corresponds to a Laurent series expansion of the integrand at £ -* oo taken along the real axis (there is an 
essential singularity at oo). These terms can be integrated analytically and the asymptotic terms are in the form 
5°°(p) = £n=o pc. By extracting the asymptotic value of the integral, the singularity due to zero radial distance 
has been removed and can be analytically treated. For instance, let’s go and see Sommerfeld integrals with and 
without the zero radial distance singularity (figures 2 and 3). Substrate parameters are e r = 12.8, h = 0.12Ao and 
tan S = 0.01 




(a) With singularity 


(b) Without singularity 


Figure 2: G v with and without singularity. 


3 Computing [Z] terms. 

Acording to the latter section, G x and G v can be written as G a = ^ + G smoot h so Z terms in (7) can be divided 
into two terms. Depending on the kind of basis and testing functions, the contribution of the singularity to the 
integral can be analytically computed. The other term can be calculated using a gaussian like cuadrature. 

In general, one chooses as basis functions the set of functions that have the ability to accurately represent the 
unknown function while minimizing the computational effort requiered to employ it. 
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(a) With singularity 


(b) Without singularity 


Figure 3: G x with and without singularity. 


The unknown current distribution can be represented using piecewise sinusoids as: 

if Xi — hi <x'< X{ 


TiW) = 




sin[fc e (*i« + x‘ - z t -)l 
sin (Ml*) 

sin [k e (hi - x' 4- a;,-)] 
sin (M 2 *) 

0 
1 

0 


if X{ < x* < + hi 

elsewhere. 

si yi - ton <it < yi + w 2 i 
elsewhere. 


( 10 ) 


( 11 ) 


where k e is the propagation effective constant; usually k e = — 2 Z , r . This kind of model is very good but it makes 

Ao\/ 3- 

expressions like (7) very expensive to compute and the singular term cannot be analytically calculated. 

On the other hand, one can use picewise linear functions to represent the current. 


Ti(x') = 


( hi + . x - Xi if Xi — hi < X f < Xi 

ki if Xi < x' < X{ -f hi 


W) = | 


elsewhere. 


wu + W 2 i 

0 


si yi - wu <j/ < yi+ w 2 i 
elsewhere. 


( 12 ) 


(13) 


This model isn’t as good as the piecewise sinusoid but it has two advantages over it. First it’s faster and simpler. 
Second the contribution of the singular term can be analytically computed. Using this model, one can use a simpler 
numerical cuadrature (vg. four points instead of eight) when computing terms with singularities (overlaped patchs) 
because the main contribution to (7) comes from an analytical formula. No numerical integration is needed to 
include the singular term. 
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4 Results. 


In this section, we are going to show some results obtained analysing two microstrip structures using four different 
schemes. 

• Piecewise sinusoids for basis and testing functions and an eight point gaussian quadrature to compute the 
numerical terms of (7). 

• Piecewise lineal for basis and testing functions and an eight point gaussian quadrature to compute the 
numerical terms of (7). No analytical integral computed for the singular term. 

• Piecewise lineal for basis and testing functions and a four point gaussian quadrature to compute the numerical 
terms of (7). No analytical integral computed for the singular term. 

• Piecewise lineal for basis and testing functions and a four point gaussian quadrature to compute the numerical 
terms of (7). Singular term is analytically computed. 


4.1 Microstrip hybrid. 

The geometry of this structure is shown in figure (4) and it was meshed using 56 overlapped rectangular patches. 



Figure 4: Hybrid geometry. 


One prototype was built and measured and the results are shown in figure (5). 



(a) Sll and S21 


(b) S31 and S41 


Figure 5: Hybrid measurements. 
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Figure 10: Calculated parameters. 


5 Conclusions. 

Table (2) shows timing for the computed simulations measured in seconds per frecuency. Simulations were made 
in a Pentium III 450 MHz machine. As one can see, piecewise lineal model with four points including analytical 
singular terms is, as least, as good as piecewise linear with eight points and 3.7 times faster. Piecewise lineal 
with four points without singularities is clearly bad and very inestable. Piecewise sinusoid is better than the other 
models simulating the meander because it’s well suited for modeling current distributions along narrow strips. On 
the other hand it’s eight times slower. 


Structure 

Sinusoid 

Lin 8pt 

Lin 4pt w/o 

Lin 4pt with 

Meander 

128 

56 

15 

15 

Hybrid 

256 

118 

31 

31 


Table 2: Simulation timing. 
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Abstract 

A higher-order method of moments is presented for the analysis of microstrip structures in a 
multilayer medium. It is shown that this higher-order method has a better convergence rate than 
the conventional lower-order methods. Several examples are given to demonstrate the accuracy and 
efficiency of this method. 

1 Introduction 

Full-wave electromagnetic modeling of microstrip structures has become essential in the analysis 
and design of microstrip antennas and circuits. A variety of methods having been developed for 
this purpose, which can be divided into two broad classes: the differential-equation based approach 
and the integral-equation based approach. For microstrip structures in a multilayer medium, the 
integral-equation based method of moments (MoM) is more attractive since the method allows us 
to formulate the problem in terms of surface integrals instead of volume integrals. 

A critical factor for an efficient and accurate MoM analysis is the choice of basis functions. 
Traditional numerical modeling employs roof-top functions for rectangular discretization or Rao- 
Wilton-Glisson (RWG) functions [1] for triangular discretization. These functions are complete 
to the zeroth order. As a result, a very fine discretization is often required to yield an accurate 
solution. This leads to a large matrix equation, which is expensive to solve. In addition, the 
numerical solution converges slowly to the exact one when the discretization is made finer. A 
solution to this problem is to employ higher-order basis functions, which have a better convergence 
rate and can yield an accurate solution with a rather coarse discretization. The better convergence 
of higher-order basis functions has been demonstrated previously in the spectral method [2] and 
the p-version finite element method [3]. In this paper, we present an MoM using higher-order 
interpolatory basis functions for electromagnetic modeling of microstrip structures in a multilayer 
medium. 

2 Formulation 

Consider a microstrip structure in a multilayer medium. A well-known integral equation for this 
problem [4] is given by 

nx [-ja. /i0 (G >1 (r,r'),J(r)> + —V(G $ (r,r'),V'-J(r'))l = -n x E“(r) (1) 

l 0 J 
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where J denotes the electric current density on the microstrip and E° is the applied field, which is 
the incident field in the absence of the microstrip for scattering problems or a delta gap source for 
circuit problems. The G A and G 4, are the Green’s functions for the vector and scalar potentials, 
respectively. In general, the Green’s function for a multilayer medium can be written as 

1 C°° - 

G( Pi z\z') = — jf G(k Pi z\^)Mk p p)kp^k p (2) 

where G is the spectral domain counterpart of G, which can be derived analytically [5]. To cir¬ 
cumvent the time-consuming numerical integration, we employ the complex image method [6]-[10], 
which yields a closed-form expression. 

To apply the MoM, the microstrip is discretized into triangular elements and the unknown 
current is expanded by a set of vector basis functions on triangular elements 

N 

J(r) = £ T„A n (r) ( 3 ) 

71=1 

where A n is the basis function and N is the number of unknowns. 

For each triangular element (Fig. 1), we define a set of area coordinates (£i,£ 2 >£ 3 )j where & 
vaires linearly across the element, reaching one at the ith vertex and zero along the opposite edge, 
which is defined as the ith edge. The well-known zeroth-order basis functions are given by 

Ai(r) = - 6_i4+i), i = 1,2,3 (4) 

where A is the area of the element and U ±i are constant vectors defined along the (i ± l)th edges. 
Note that n* * A; is constant on edge i and A is linearly varying along edge i — 1 and t +1, as shown 
in Fig. 2. Also noting that 

V£ = n4, (5) 

we find the divergence of A as 

V • A = j. (6) 

The zeroth-order basis function is also known as the RWG basis function [1]. 

The higher-order interpolator vector basis functions on a given triangular element are con¬ 
structed by multiplying the zeroth-order basis functions with a set of polynomial functions [11] 

Ag.*(r) = ATff (P + 2 y^ K) A^(r) (7) 

where (3 denotes the edge number associated with the zeroth-order basis function, i,j, k are the 
indexes for labeling the interpolation points, which satisfy i + j + k = p + 2, and iff takes i.j ,or k 
for /? = 1,2, or 3, respectively. The normalization coefficients N^ k are given by 


The is the polynomial function defined in terms of shifted Silvester-Lagrange polynomials 

R as 


— Riip + 2 ^\)Rj{p + 2 , £2)^ (p + 2,^3) 


(9) 
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where the shifted Silvester-Lagrange polynomial is given by 


R>(p,(,) = 


{ 


pijrntiw-fc), 

i, 


2<i<p + l 

i = 1. 


( 10 ) 


The number of degrees of freedom is (p+ l)(p + 3) on a triangular element for the basis functions 
of order p. The first-order basis functions so obtained are shown in Fig. 3. 

Substituting (3) into (1) and applying Galerkin’s procedures, we obtain a matrix equation 


ZI = V. 


(ID 


For calculating the matrix elements, the double surface integrals are evaluated using Gaussian 
quadrature when the two elements do not overlap. When they do, the method proposed by Duffy 
[12] is employed to evaluate the singular integral. The solution of this matrix equation gives the 
current distributions on the conductor surface, from which the radar cross section (RCS) or S- 
parameters can be extracted [13]. 



Figure 1: Notations of a triangle. 



Figure 2: The zeroth-order basis function. 


3 Numerical Results 

The convergence behavior of the higher-order MoM is first analyzed using a microstrip patch 
antenna as an example. The patch is 40 mm by 40 mm, which resides on a substrate with relative 
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(a) (b) 

Figure 4: Convergence behavior of high-order basis function, (a) Relative error versus number of 
unknowns, (b) CPU time versus number of unknowns. 


permittivity 2.17 and thickness 1.58 mm. The incident plane wave is assumed to be 9 polarized 













Figure 5: Backscattered RCS (ae$) versus frequency for a rectangular microstrip antenna [14]. 


with the incident angle ( 0 z ,0 l ) = (60°,45°). The frequency is 10 GHz. The 9 component of 
backscattered RCS is calculated. The simulation is carried out at various levels of discretization for 
basis functions with different orders. The error is plotted in Fig. 4(a) by computing |cr — <7 r ef\/Wref\ 
where a re f is the referenece RCS result by running the third-order scheme at a discretization of 
400 triangles. The corresponding CPU time for matrix filling is given in Fig. 4(b). From the 
figures, we observe that for the same number of unknowns, the higher-order scheme gives more 
accurate results and the CPU time are comparable for different order schemes. For small problems, 
the higher-order schemes has more CPU time because the singular and near interaction terms are 
relatively dominant. When the problem size becomes large, the higher-order schemes become be 
more efficient than the lower-order ones. 

As the first example, we calculate the backscattered RCS of a microstrip patch antenna [14]. All 
the parameters are the same as those in the previous convergence analysis except for the patch size. 
Here, the size is 36 mm by 26.6 mm. The zeroth-, first-, and second-order schemes are employed 
for a coarse discretization with 24 triangles. As can be seen from Fig. 5, the zeroth-order scheme 
does not give an accurate result, especially at high frequencies, whereas the first- and second-order 
schemes converge to the accurate result. The numbers of unknowns for the zeroth-, first-, and 
second-order approaches are 29, 106, and 231, respectively. 

The next example is a crossed dipole antenna in a double-layer medium [10], [15]. The two 
crossed dipoles are on the top layer, which have different lengths so that the dual frequency oper¬ 
ation is achieved. One is 11.9 mm long and the other is 10.2 mm long. Both of them are 1.7 mm 
wide. The feeding lines are on the bottom layer, which are 2.2 mm wide. The substrates have the 
same relative permittivity 2.17. The bottom layer is 0.8 mm thick and the top one is 1.6 mm thick. 
The coarse discretization is shown in Fig. 6(a). Different order basis functions are employed. The 
S-parameters are plotted in Fig. 6(b)-(d), which show that the higher-order basis functions give 
more accurate results. The numbers of unknowns for the zeroth-, first-, second-, and third-order 
approaches are 61, 246, 555, and 988, respectively. 

The last example is a microstrip filter [16]. The photonic bandgap effect can be achieved 
from this structure. The coarse discretization is shown in Fig. 7(a). The zeroth- and first-order 
approaches are employed here, which have 347 and 1274 unknowns, respectively. The S-parameters 
are given by Fig. 7(b) and (c). 
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(c) (d) 

Figure 6: The S-parameters of a crossed dipole antenna [10], [15]. 

4 Conclusion 

A higher-order method of moments is presented for the analysis of microstrip structures in a 
multilayer medium. It is shown that this higher-order method has a better convergence rate than 
the conventional lower-order methods. Several examples are given to demonstrate the accuracy and 
efficiency of this method. 
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(a) 



(b) (C) 


Figure 7: The photonic bandgap effect of a microstrip filter [16]. 
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Abstract: 

Certain applications are constrained in available surface 
area for the location of multiple antennas. A solution to 
this was developed and tested that is both conformal and 
flexible with multifunction capability. The antenna has 
capabilities in the VHF, UHF and L-Band frequency bands. It 
has both transmit and receive capability in both VHF and 
UHF and receive only for L-Band. The VHF portion handles 50 
Watts CW and the UHF capability is 200 Watts CW. The 
antenna described has 3.0 inches of flexure with a 19 inch 
square aperture in a 3.75 inch overall height. The UHF 
portion has both low and high angle modes. This antenna 
required a new method of installation and removal from 
arbitrary curved surfaces. Successful flight testing has 
occurred. Several modifications were made and limited data 
is presented illustrating its multifunction capability. 

Introduction: 

The typical antenna farm installation consists of a single 
antenna for every function and frequency band of interest. 
This can quickly result in a lack of available surface area 
for antenna installation. Typical solutions call for 
deletion of antennas or combinations of antennas.[1] .The 
problem is further aggravated by potential transmitter 
coupling, maintaining required field of view and meeting 
single antenna performance in a multifunction aperture. 
Mating conformal antenna(s) to an arbitrary surface is 
still a major problem. This solution requires a pliant or 
flexible antenna with multifunction capability. This has 
been investigated by several groups with solutions such as 
the flexible tape array that is minimally flat with a small 
amount of allowable bending. [ 2 ] A second antenna solution 
is the flexible cushion that is laid on any available 
surface. [3] These antennas are all limited to a single 
antenna function. The effort reported here resulted in a 
solution whereby antenna apertures are combined in a single 
aperture. 
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Antenna Requirements: 

The antenna was designed for installation on a multitude of 
different shaped surfaces. Figure 1 defines the antenna and 
the localized directions for installation. The antenna has 
multiple communication capability in the VHF and UHF bands 
and receive only capability for GPS LI and L2 frequencies. 
The requirements are summarized in Table I. 

Table I. Conformal, Flexible Multifunction Antenna 
Requirements 

Frequency Bands: VHF - 118 to 174 MHz 
UHF - 240 to 320 MHZ 
GPS - LI and L2 

Polarization: VHF - Vertical 

UHF (Mode 0) - RHCP 
UHF (mode 1) - Vertical 
GPS - RHCP 

Azimuthal Coverage: 360 degrees 

Elevation Coverage: VHF - Standard Monopole 

UHF (mode 0) - Upper Hemisphere 
UHF (mode 1) - Standard Monopole 
GPS - Upper Hemisphere 

Gain: VHF - 0 dBli 

UHF (mode 0) - +3.5 dBic 
UHF (mode 1) - +3 dBli 
GPS - +3 dBic 

VSWR: VHF - 2.0:1 with 2.5:1 maximum 

UHF - 2.0:1 with 2.5:1 maximum 
GPS - 2.0:1 maximum 

Power Handling: VHF - 50 Watts CW 
UHF - 200 Watts CW 
GPS - Receive Only 

Pressure: 2.0 psi (4.0 psi ultimate) on Edges 

10.0 psi (20.0 psi ultimate) on Top Surface 

Weight: Less Than 25 Pounds 

Size: 19 inches Square, 3.75 inches in Height 
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Solution: 


The antenna solution is shown in figure 2. The antenna is 
extremely pliant with built-in flexure from fore to aft. 

The antenna weighed 20 pounds and met all power handling 
and required frequency band operation. Every requirement 
from Table I was met except for UHF mode 1 that requires a 
feed modification. However, it is noted that mode 1 
performance is a goal. 

The antenna solution uses four point feeding of an air 
loaded patch antenna for UHF.[4] VHF is handled with a 
broadband matched whip antenna inclined at 30 degrees aft. 
GPS is handled with a high 'K' stacked patch antenna 
located slightly off centerline in the fore area of the VHF 
antenna. The antennas were all positioned on the UHF 
antenna for minimized coupling. The UHF antenna center is a 
zero field point that allowed the installation of all 
decoupling and matching components in a centrally located 
cylinder. 

Problems Discovered and Resolutions: 

The GPS antenna function operated flawlessly with no 
interactions with VHF or UHF communications due to a built- 
in filter and two octave frequency separation. However, VHF 
energy did couple to UHF communications due to close 
proximity and the minimal 66 MHz frequency separation. This 
coupling was fixed with a small amount of inductance and 
attenuation using a coaxial line matching transformer 
inserted at the VHF feedpoint. 

The UHF function also exhibited the problem of changing 
matching condition with flexure of the aperture and 
installation on the arbitrary curved surface. These 
problems were corrected with identical feed point match in 
four places that made flexure changes benign to antenna 
performance. 

The antenna is installed via an insulating layer of 
proprietary adhesive, that can be removed, providing for a 
quick install and de-install of the antenna. The UHF 
antenna was tuned with the insulating layer on a simulated 
ground plane providing maximum gain over frequency. This 
accounted for dramatic variations in gain with 
installation. Figure 3 shows the gain at UHF when the 
antenna is incorrectly installed. Figure 4 shows the gain 
at UHF when the antenna is correctly installed. 
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Measurements: 


Measurements were made on all functions. GPS was well 
within the required VSWR and gain. VHF exhibited a worst 
case 2.4:1 VSWR. UHF mode 0 is 1.5:1 VSWR maximum while 
mode 1 was 3.4:1 VSWR maximum. Radiation patterns were 
obtained and were as expected with no frequency dropouts or 
angular dropouts. Figure 5 shows a typical VHF radiation 
pattern. Figure 6 shows a typical set of un-normalized UHF 
mode 0 radiation patterns. A slight skewing was discovered 
in the UHF mode 0 function, but it changed slightly with 
installation and in all cases was acceptable. The UHF mode 
1 function radiation patterns are close to theory, but the 
gain is lower than expected. Gain at VHF is + 5 dBli and + 

3 dBic for GPS. UHF mode 0 has gain of + 3.5 dBic. 

Summary: 

A conformal, flexible multifunction communications antenna 
was developed, demonstrated and is still operating in the 
field. The antenna has been flown and can be installed and 
de-installed from a multitude of curved surfaces. All 
functions met the specifications in Table I except for UHF 
mode 1. A solution for this deficiency has been developed 
and is being presently pursued. 
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Abstract 

Accurate and efficient modeling of finite printed antenna arrays is of increasing practical 
importance. This paper presents an adaptive multi-resolution approach for performing such 
modeling. Its merits for single elements are demonstrated and the extension to finite arrays is 
discussed. 


1 Introduction 

Multi-functionality, broad bandwidth and size requirements are important issues in printed antenna 
design. Printed antennas meeting such demands often incorporate complex materials and fine 
features as part of their construction and hence part of these antennas often require a very high 
resolution for proper numerical modeling. High resolution is also needed for accurate feed modeling 
and input impedance computations. Accurate modeling of finite arrays of such multi-function or 
broadband elements presents additional challenges. The demands for high resolution result in an 
excessive number of unknowns which cannot easily be handled using traditional numerical methods, 
even when fast integral or hybrid techniques are employed. 

In this paper, we discuss a new approach for accurate and efficient modeling of finite printed an¬ 
tenna arrays which may incorporate fine features. To overcome the difficulties in modeling the feed 
region, fine geometric details and complex or dense materials, the hybrid finite element / boundary 
integral (PE/BI) method [1] is employed in conjunction with hierarchical mixed-order tangential 
vector finite elements (TVFEs) for tetrahedra [2], Such TVFEs avoid any need for re-meshing 
while allowing accurate modeling of rapid field variations provided the order of the approximation 
within each element is determined properly. This can be done adaptively which eliminates the need 
for human choices as part of the modeling process. To overcome the difficulties in modeling the 
finite nature of the array, we discuss an approach which is based on the fast spectral domain algo¬ 
rithm (FSDA) formulation for infinite periodic arrays [3] but employs a modified aperture Green’s 
function which accounts for edge effects in an approximate manner. This modification influences 
the matrix-vector products within the iterative solver algorithm and has a minimal effect on the 
central processing unit (CPU) time requirements for the iterative solver. Hence, the CPU time 
requirements for the proposed approach are comparable to those for FSDA infinite array analysis. 
However, although the proposed approach is not exact, it is more accurate than simply utilizing a 
truncated infinite array solution as the latter approach does not incorporate edge effects at all. 

This paper is organized as follows. Section 2 briefly outlines the adaptive refinement strategy 
adopted in the paper. Section 3 offers some numerical results demonstrating the merits of the 
proposed adaptive multi-resolution modeling approach for a single antenna element. Section 4 
discusses the proposed approach for finite antenna arrays. Section 5 concludes the paper. 
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2 Adaptive refinement strategies 

In this paper, a conceptually very simple adaptive refinement strategy is adopted. Following a 
FE/BI solution with mixed-order TVFEs of order 0.5, we determine the degree of error in each 
element via an error indicator. A refined solution is then computed where a certain pre-specified 
percentage of the elements having the highest degree of error are modeled with mixed-order TVFEs 
of order 1.5 and the remaining elements are again modeled with mixed-order TVFEs of order 0.5. 
Three different error indicators are presented below. 

For a general three-dimensional electromagnetic problem, we discretize the computational do¬ 
main into N c tetrahedral elements denoted by T e , e = 1, •• •, N e y each having 4 faces denoted by 
F/, t = 1,•" ,4. The center of T e and Ff is denoted by C(T e ) and C(Ff ), respectively, and the 
unit normal vector to Ff directed out of the element T e is denoted by n*. The lowest order FE/BI 
solution leads to an approximation of the electric flux density D within and on the boundary of 
each element T e . On the face Ff, we let denote the value of D evaluated in T e and let D t e,out 
denote the value of D evaluated in the element bounding T e . As discussed in [4], three suitable 
error indicators for an element e are 


Eh(e) = |D(C(n)l 

Eh(e) = rnax{|n? • (D*’ in - D'' oue )| C(i?) } 


n e fT"l e ’* n T-» e '°Ufc 



(1) 

1(7(1?)} 

(2) 

’° U, )| J <£S j 

(3) 


The corresponding error indicators for a sub-domain of elements are simply the maximum of the 
error indicators for the elements comprising the sub-domain. 


3 Single antenna element analysis 

We consider a metallic printed bowtie antenna backed by a rectangular cavity recessed in an infinite 
metallic ground plane, as illustrated in Fig. 1 (side view) and Fig. 2 (top view). The cavity-backed 
patch antenna is situated in free space characterized by the permittivity £o and the permeability /zo- 
The cavity dimensions are 48 cm x 64 cm x 12 cm. The interior metallic cavity walls are covered 
with an artificial absorber of permittivity (1 — j2.7)eo: permeability (1 — j2.7)fJto and thickness 4 
cm. This absorber layer would not be part of an actual antenna but is merely a well-established 
computational tool that serves to approximately simulate a metallic printed bowtie antenna situated 
in free space. The specific printed bowtie antenna consists of two isosceles triangular patches 
characterized by the opening angle 67.38° and the maximum width 24 cm. The bowtie patches are 
centered in the cavity aperture and fed by a probe of constant current connecting the two triangular 
patches. 

An antenna very similar to the above was discussed by Collin [5] and is expected to cover the 
UHF channels 14 to 83 spanning the frequency range [450 MHz, 900 MHz] w'hen used with a 300 ft 
feed line. That is, the real and imaginary parts of the input impedance are expected to hover 
around 300 ft and 0 ft, respectively, in this frequency range. 
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Figure 1: Side view of a metallic printed bowtie antenna backed by an air- and absorber-filled 
rectangular cavity recessed in an infinite metallic ground plane. 



Figure 2: Top view of a metallic printed bowtie antenna backed by an air- and absorber-filled 
rectangular cavity recessed in an infinite metallic ground plane. 


For analysis and evaluation of the proposed error indicators, we discretize the boundary integral 
(BI) surface and the patch into a coarse (56 antenna triangles and 330 BI triangles) and a fine (160 
antenna triangles and 1030 BI triangles) mesh. These surface meshes are extruded into the cavity 
to form three prism layers. Each prism is broken into three tetrahedra and four different TVFE 
options are applied. First, the mixed-order TVFE of order 0.5 throughout the cavity for the coarse 
mesh. Second, the mixed-order TVFE of order 0.5 throughout the cavity for the fine mesh. Third, 
the mixed-order TVFE of order 1.5 throughout the cavity for the coarse mesh. Fourth, the mixed- 
order TVFE of order 0.5 in conjunction with the mixed-order TVFE of order 1.5 in regions found 
adaptively (20% of the TVFEs are of order 1.5) for the coarse mesh. The adaptive refinement is 
carried out for each of the error indicators EIu EI 2 and Eh given by (l)-(3) on a sub-domain 
by sub-domain basis only with a sub-domain being three adjacent prisms extruded from a surface 
triangle (386 sub-domains for the coarse mesh). 

Real and imaginary parts of the input impedance as a function of frequency are given in Fig. 3 
for the first three TVFE options (no adaptivity). Instead of the strongly resonant behavior char¬ 
acterizing cavity-backed patch antennas, we observe the expected slightly oscillatory behavior of 
the real and imaginary parts. Mixed-order TVFEs of order 0.5 for the fine mesh and mixed-order 
TVFEs of order 1.5 for the coarse mesh give similar results that are better (real parts closer to 
300 and imaginary parts closer to 0S2) than those with mixed-order TVFEs of order 0.5 for the 
coarse mesh. Although mixed-order TVFEs of order 0.5 for the fine mesh and mixed-order TVFEs 
of order 1.5 for the coarse mesh give similar results, the latter approach is more attractive than the 
former in terms of memory and CPU time requirements. 
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Figure 3: Real and imaginary part of the in¬ 
put impedance for the metallic bowtie patch an¬ 
tenna in Fig. 1-2. 



Figure 4: Real and imaginary part of the in¬ 
put impedance for the metallic bowtie patch an¬ 
tenna in Fig. 1-2. 


To demonstrate the merits of adaptive refinement, Fig. 4 shows the real and imaginary parts 
of the input impedance as a function of frequency for the coarse mesh with mixed-order TVFEs of 
order 1.5 throughout the cavity and with 80% mixed-order TVFEs of order 0.5 and 20% mixed- 
order TVFEs of order 1.5 found via adaptive refinement using EI\, EI 2 and EI 3 on a sub-domain 
by sub-domain basis. The results are almost indistinguishable expressing that we can accurately 
predict the behavior of the antenna using only 20% mixed-order TVFEs of order 1.5. This presents 
a significant memory and CPU time improvement at virtually no cost. 

To illustrate the regions of refinement, we consider a cross-section of the mesh parallel to the 
antenna with the boundaries of the metallic cavity and antenna marked with thick black lines. As 
mentioned previously, the tetrahedral volume mesh is grown from a triangular surface mesh in a 
cut in the plane of the bowtie patches by extruding it into three prism layers and breaking each 
prism into three tetrahedra. Since we consider sub-domain by sub-domain refinement only, 0 or 9 
tetrahedra can be refined corresponding to a given triangle in the cross-section. In the following, 
a white / dark gray triangle indicates that 0/9 tetrahedra are being refined. The regions of 
refinement at 0.7 GHz (close to the center of the frequency band of operation) are shown in Fig. 5. 
Although they are seen to differ slightly for the three different error indicators, they all show the 
general trend of predicting the feed area and, to a lesser extent, the corners of the triangular patches 
as the regions where mixed-order TVFEs of order 1.5 are applied. 


4 Finite antenna array analysis 

Accurate and efficient analysis of large finite arrays is of significant interest for practical phased- 
array applications. A fast but approximate approach is to consider the array as infinite in order to 
reduce the complexity of the problem by studying only a unit cell of the periodic structure. This 
approach has been shown to be accurate and efficient for radar cross section (RCS) computation 
of finite cavity arrays when utilizing the hybrid FE/BI technique [6]. In [6], the scattering pattern 
for a finite array was obtained by multiplying the pattern for a single unit cell of the corresponding 
infinite array with the proper array factor. Alternatively, one can simply truncate the infinite array 
solution in the finite array region. This truncation is carried out by generating the finite array near 
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Figure 5: Regions of refinement for the metallic bowtie patch antenna in Fig. 1-2 with Eli (left), 
EI 2 (middle) and EI 3 (right). 


fields using an infinite array unit cell solution along with the periodic phase shift imposed by the 
Floquet mode series. As an application of this approach, we consider the radiation from slot antenna 
arrays (Fig. 6). Radiation patterns of 3 x 3 and 5x5 slot antenna arrays are given in Fig. 7 for 
the truncated infinite array solutions along with corresponding finite array solutions obtained via 
rigorous but inefficient finite array analysis (brute force simulation of all array elements). As seen, 
the truncated infinite array patterns agree reasonably well with the exact finite array patterns in 
this case. However, for composite structures involving complex materials, we expect the agreement 
between the exact finite array solution and the approximate one based on the infinite array to 
be less acceptable. Furthermore, we emphasize that neither of the two approximate approaches 
incorporate the finiteness of the array into the analysis. That is, edge effects are not taken into 
account. 

In the literature, there have been important attempts for accurate and efficient finite array 
analysis by taking advantage of the infinite array solution while including the edge effects. In [7], 
edge effects are taken into account through a convolution of the associated infinite array spectra 
with the finite array footprint. Poisson’s sum formula (PSF) for finite sums is employed to obtain 
an infinite series representation from a finite series one. However, the incomplete form of the PSF 
which does not include explicit end point contributions is used, as recently pointed out in [8]. As 
discussed in [8], it is important to consider the complete PSF which contains explicit end-point 
contributions in order to evaluate array surface currents accurately. To emphasize the use of the 
complete PSF for a two-dimensional finite array, the array factor of a 41 x 41 uniform planar array 
is considered in Fig. 8 for a 0.5Ao spacing and the scan direction ~ (30°, 45°). As seen in 

Fig. 8, the incomplete PSF results in error at angles far from the main beam and yields incorrect 
locations for pattern nulls and lobes. This is similar to the one-dimensional example given in [8]. 
Here, we employ the complete form of the PSF in our analysis which is presented in the following. 

In dealing with the computational burdens of finite arrays, we introduce a new approach for 
pattern computation. The approach is based on the FSDA formulation for infinite periodic arrays 
[3]. However, we instead employ a modified aperture Green’s function which is obtained by using the 
complete PSF to account for edge effects in an approximate manner. This modification influences 
the matrix-vector products within the iterative solver algorithm and has a minimal effect on the 
CPU time requirements for the iterative solver. As a result, the CPU time requirements for finite 
array analysis are comparable to those for infinite array analysis. Below, we briefly discuss the 
formulation of the proposed approach. 
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Figure 6: Slot antenna array geometry. 




Figure 8: Array factor of a 41x41 uniform planar array with an element spacing of 0.5Ao and the 
scan direction = (30°, 45°). 
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The FSDA [3] employs the spectral domain Green’s function to efficiently evaluate the BI 
matrix-vector products carried out within an iterative solver. Instead of explicitly generating all 
BI matrix elements, the actual current distribution is summed up in the spectral domain in each 
iteration and the spectral Floquet mode series is computed only once per weighting function. Thus, 
the FSDA results in a CPU time complexity of O(N) as well as a memory demand of 0(N) for a 
given fixed set of modes. Compared to the adaptive integral method (AIM) [9], the FSDA leads to 
enormous speed-ups of about two orders of magnitude for relatively small numbers of unknowns [3]. 
In the proposed approach, we take advantage of the speed and memory superiority of the FSDA. 


We begin with a truncated infinite array Green’s function representation. For a finite periodic 
array with ( 2 M x -f 1 ) x ( 2 N y +1) elements, the free space periodic Green’s function can be written 


M X Ny 

G p (r,r') = 4 V V —f— e -ikz m „\z-z'\ e -jk tmr ,-(p-p') ( 4 ) 

A m--M x n=-JV v 2j zmn 


where the notation of [10] is used. Using a procedure similar to that in [T], we then employ the 
complete PSF on the doubly finite series in (4) to obtain an infinite series representation convolved 
with a pulse function representing the footprint of the finite array. This convolution operation is 
easily carried out since it involves an impulse function. Considering the discretized version of the 
BI by applying Galerkin’s testing with the well-known Rao-Wilton-Glisson basis functions b g (r) 
[ 11 ], we end up with the representation 


(Mr), H(r)> = W + F ? + + n r ] >9 = 1.•••> N B i 


N B i 


k 0 A 


p =i 


(5) 


where E p represent Nbi electric field expansion coefficients and F? p (i = 1,2,3,4) are given by 
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E 
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E 

y ^ bg(kf mn ) * G^kfron) • bp(kf mn ) 

(9) 


m——M x n~±N y 


where * and ~ denote complex conjugation and spectral domain representation, respectively, and 
Ci are constants determined by the geometry of the unit cell. Also, G(k fTnn ) is the discrete version 
of the spectral domain half space dyadic Green’s function as given in [ 6 ]. The four terms F? v in 
(5) are due to the complete PSF and represent the finiteness of the array. That is, they take into 
account edge effects. They are readily implemented in the context of the FSDA as outlined in [3]. 
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We note that the computational domain for the finite array analysis is only a single unit cell (as 
in the case of an infinite array) and that we still consider the phase boundary conditions within 
the finite element volume [10). However, as compared to the approach of simply truncating the 
infinite array solution, the proposed methodology provides superior accuracy since it includes edge 
contributions. 

5 Conclusions 

This paper presented an adaptive multi-resolution approach for accurate and efficient modeling of 
finite antenna arrays. Its merits for single elements were demonstrated and its extension to finite 
arrays was discussed. 
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1. ABSTRACT 

This paper presents an efficient method to solve the problem of radiation from conformal aperture and 
microstrip antennas mounted on arbitrarily-shaped conducting bodies. The method, based on the surface 
equivalence and reciprocity principles, uses a combination of the Finite Difference Time Domain (FDTD) and 
Method of Moments (MoM) techniques to substantially improve the computational efficiency of the radiation 
pattern calculation. When the geometry and location of the radiating element are modified, only a small portion 
of the overall analysis requires re-simulation. This leads to a significant improvement in computational 
efficiency over presently-used techniques, and can substantially improve design efficiency when included in an 
optimization loop. The technique is first validated by solving two canonical problems, namely a thin slot which 
is oriented either axially or azimuthally on an infinitely long, perfectly conducting cylinder. These patterns are 
then compared to those produced by the same slots mounted on finite-length cylinders. Finally, patterns are 
computed for a cavity-backed elliptical patch antenna mounted on an infinite-length PEC cylinder and 
compared to patterns computed by an alternate method. 

2. INTRODUCTION 

Low profile conformal antennas for use on mobile communications systems are receiving widespread 
attention due to the advantages of minimized aerodynamic friction and radar cross-sectional area, reduced risk 
of antenna structural damage, and simplified construction requirements over conventional types. Since this type 
of antenna has a complex configuration and is usually mounted on a large structure, the simulation of these 
antennas continues to be a challenge. Methods that will accommodate complex structures such as FDTD and the 
Finite Element Method (FEM) require an excessive number of volume elements for large bodies. Surface 
methods such as the Method of Moments (MoM) that can model large structures are limited to relatively simple 
material compositions such as homogeneous or layered material structures. Thus, any of these methods used 
alone cannot efficiently model a typical conformal antenna; hence, the need for a combination of techniques. 
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Several combinations of simulation techniques have been used; the hybrid method used in this paper 
includes the MoM and FDTD. These methods were selected for their strengths in modeling large arbitrarily- 
shaped homogeneous conducting bodies and relatively small regions with a complex material structure, 
respectively. The link between the two methods is the reciprocity theorem, which relates the currents and fields 
between two systems [1-3]. 

The hybrid method used in this paper reduces computational requirements for the calculation of 
radiation patterns, especially when the shape of the radiating element and location on the mounting structure are 
modified. For most configurations, the size and shape of the mounting structure is fixed: the proposed method 
exploits this constraint by requiring analysis on this part of the problem only once. Instead of re-simulating the 
entire structure to obtain the aperture field distribution for every element shape and location change, this 
approach requires a re-simulation in the vicinity of the element only if the element shape is changed. The 
proposed hybrid method eliminates the need to carry out multiple computational ly-intensive matrix operations 
encountered in the use of other simulation methods. 

Since the FDTD technique is well-suited to the analysis of complex inhomogeneous structures, it is 
used to model the region which includes the cavity-backed conformal antenna and a sufficient volume around it 
to account for the effects of the surrounding environment on the radiating aperture field distribution. Through 
the surface equivalence principle, equivalent currents are found and the cavity region is closed with a perfect 
electrical conductor (PEC). The resulting structure, which can be large relative to the radiating aperture and of 
arbitraiy shape, is analyzed by the MoM technique, since this method approximates the surface with triangular 
element discretization. This enhances both computational accuracy and efficiency via the use of a closely-fitting 
surface mesh of the PEC object, as opposed to using a volume discretization, which would be required in the 
FDTD method. 

3. APPROACH 

This paper utilizes the reciprocity principle to divide the original problem into two parts [1-3]. A 
reciprocity approach is also applied in [4]; however, the Finite Element Method (FEM) is used to compute the 
equivalent magnetic currents on surface S! instead of the Finite Difference Time Domain (FDTD) technique. 
First, we analyze the region containing the conformal antenna, which is typically inhomogeneous. The FDTD 
method is used for this analysis to derive equivalent electric and magnetic currents J and M on the surface Si 
(see Fig. 1) of the radiating aperture of the antenna. Next, Si is backed by a perfect electrical conductor (PEC) 
to short out the electric currents and the problem reduces to that of computing the radiation from these magnetic 
currents located on the closed PEC body. This calculation takes into account the shape of the PEC body in the 
vicinity of the antenna. 
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The second step entails the application of the reciprocity principle to address the pattern computation 
problem. By invoking the reciprocity principle, we can write 

Je, • J 2 dV = -Jh 2 -M, dV (1) 

V V 

Our objective is to compute Ei radiated by Mi at a far-zone observation point P. We proceed by solving for the 
surface electric currents flowing on the closed PEC body by illuminating it with an infinitesimal dipole source 
J 2 placed at location P. This problem is well-suited for handling by the Method of Moments (MoM). From the 
electric currents, the equivalent surface magnetic fields H 2 are determined and by performing the integration on 
the right-hand side of (1), the desired field Ej is computed. 

Note that once H 2 is determined for the entire mounting structure, only Mi requires recalculation if the 
radiating element changes shape or location. The volume integral on the right-hand side of (1) reduces to the 
surface area of the superstate (see Fig. 1). This re-simulation requires only a small percentage of the time 
required for a full-wave analysis of the entire mounting structure. 


4. NUMERICAL RESULTS 

To illustrate the procedure, we first compute the radiation pattern produced by a thin slot mounted on an 
infinite-length PEC cylinder. The slots are oriented in two directions: along the cylinder’s axis (axial) and in the 
circumferential (azimuthal) direction. For the axial orientation, the aperture fields in the slot are ^-polarized 
(across the narrow width of the slot); thus, the |E$| pattern was computed. For the reciprocity calculation, the far- 
zone dipole source must be polarized in the same direction; therefore, analytical formulas taken from [5] to 
determine surface currents generated by a TE Z plane wave were used to obtain the necessary H fields (from the 
surface electric currents). For the azimuthal slot, the polarization of the aperture field is in the z-direction; thus, 
surface currents generated by a TM Z plane wave were used in the reciprocity pattern computation for |E e |. The 
slot field variation was assumed to be cosinusoidal (dominant mode) and uniform along the length and width of 
the slot, respectively, for both orientations. 

The reciprocity-approach patterns are compared to analytical formulations for these configurations [5] 
and are found to be in exact agreement (see Figs. 2 and 3 below). 
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|EJ vs. <J» Pattern (Direct) for Axial Slot in |E + | vs. <{> Pattern (H*M) for Axial Slot in 

PEC Cylinder (Radius=0.2X, Length=oo) PEC Cylinder (Radius=0.2/*, Length=co) 


Fig. 2 Radiation Patterns: Axial Slot on Infinite-Length PEC Cylinder: 
Computation by Analytical (Direct) vs. Reciprocity (H*M) Methods 




|Eel vs. 4> Pattern (Direct) for Azimuthal Slot (24.8°) |E e | vs. <j> Pattern (H*M) for Azimuthal Slot (24.8°) 

in PEC Cylinder (Radius =0.2A., Lengths) in PEC Cylinder (Radius=0.2/t, Length=oo) 


Fig. 3 Radiation Patterns: Azimuthal Slot on Infinite-Length PEC Cylinder: 
Computation by Analytical (Direct) vs. Reciprocity (H*M) Methods 


The above patterns are computed for slots mounted on a 0.2X radius PEC cylinder. The subtended angle for the 
azimuthal slot is 24.8°, which is identical to the one in the example considered in [5]. 

Simulations were then carried out on finite-length cylinders. Since analytical formulas do not exist for 
surface currents generated by plane waves incident on finite-length cylinders, a numerical code (MoM) was 
used to obtain these currents. For the axial slot, the polarization of the slot aperture and therefore the far-zone 
dipole source are <j>-directed. Hence, the truncation in the z-direction has a minimal effect on the pattern. This 
can be seen in the figure below which plots the patterns produced by axial slots mounted on IX, 2X, and, infinite 
length PEC cylinders. 
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Fig. 4 |E*j vs. <f> Radiation Patterns: Axial Slot on Finite-Length PEC Cylinders 

The solid line represents the pattern for an axial slot mounted on an infinite cylinder. Thus, for a 2X- 
length cylinder, the pattern has essentially converged to the infinite case. The pattern for the l>.-length cylinder 
shows a higher back lobe level and deeper nulls caused by an enhanced standing wave presence created by the 
decreased distance to the truncated edge. 

For the azimuthal slot, since the length of the cylinders had a significant impact on the shape of the 
patterns - an odd versus an even number of wavelengths - they are presented separately (see Fig. 5 below). 




[E e | V5. c[»: Odd ^-Length Cylinder |E e | vs. <(>: Even X-Length Cylinder 

Fig. 5 Radiation Patterns: Azimuthal Slot on Finite-Length PEC Cylinders 

Again, the solid line shows the pattern produced by an azimuthal slot mounted on an infinite-length cylinder. 
The large back lobes for the even wavelength case are due to the current pattern formed on the finite cylinder 
when illuminated by a z-polarized plane wave. Since the truncation direction (z) is in the same direction as the 
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polarization of the plane wave, standing waves are formed with the number of peaks equaling the number of 
wavelengths. For the even wavelength case, the <f>-cut at the z-midpoint passes through a local minimum in the 
current pattern on the lit (incident) side and a small local maximum on the shadow side of the cylinder. It is this 
shadow side peak that causes the large back lobe which appears in the patterns for the even case. For the odd 
case, the 4>-cut at the z-midpoint passes through a local maximum on the lit side and a local minimum on the 
shadow side. This results in a smaller back lobe. 

While the patterns appear to converge to the infinite case more closely for the odd wavelength case, 
both cases tend toward the infinite case as the length of the cylinder increases. The standing wave effects 
diminish only after the cylinder has reached a substantial length (beyond what could be simulated here), at 
which point both the odd and even wavelength cases begin converging to the infinite length case. 

Finally, the radiation pattern produced by a cavity-backed elliptical patch antenna covered by a 
superstrate and mounted on an infinite-length PEC cylinder was computed. The radius of the cylinder is 0.63 \X 0 
(0.5?to + 0.2Xd). The patch considered for this analysis is elliptical with major and minor 
0.2Xd, respectively, and was placed inside a rectangular block of dielectric (e r = 2.33) 
direction) and width 0.8^ (y’-direction) with a thickness of 0.2A,d. The major axis of the 
with the cylinder axis. A diagram of the conformal antenna is shown in Fig. 6 below. 



Fig. 6 Elliptical Patch Antenna Mounted on Infinite PEC Cylinder 

The current distribution on the surface of the PEC cylinder is first calculated by analytical methods [5], 
followed by a computation of the electric fields (computed by a locally conformal FDTD code [6]) on the 
surface of the superstrate. The patch antenna is linearly polarized along the cylinder z-axis (patch x’-axis), 
hence the |Eq| field (0 = 90°) pattern is computed. This requires the computation of the surface electric currents 
produced on the cylinder by a z-polarized plane wave, which produces only z-directed currents. Therefore, only 
the x’-polarized aperture fields on the superstrate surface contribute to this pattern. 


axes of 0.25^ d and 
of length R d (x J - 
ellipse was aligned 


747 







The resulting patterns were constructed by summing the complex dot-product of the electric current and 
electric field for each point on the surface of the superstate and repeating this calculation while rotating the 
array of surface currents in the (^-direction. This dot-product is effectively the same as H 2 ♦ Mj which appears on 
the right-hand side of (1), due to the normal vector being common to both vector quantities. The resulting 
pattern (H*M in Fig. 7 below) is compared to another obtained via the reciprocity theorem applied in a different 
manner. In this second approach, an infinite-length dielectric coated cylinder is considered, for which analytical 
formulas for the electric field inside the dielectric produced by an incident plane wave have been derived. The 
electric currents generated on the patch due to the feed (placed on the x’-axis halfway between the center and 
left edge) were computed using an MoM code. The MoM formulation assumes that the dielectric and backing 
ground plane are infinite in extent. These currents are then combined with the electric fields inside the dielectric 
at the location of the patch, and integrated over the surface of the patch to produce the radiation pattern. This 
plot is denoted E*J in the figure below. 




|E©| vs. H*M Approach |E 0 | vs. 4>: E*J Approach 

Fig, 7 |EJ Radiation Patterns: H*M vs. E*J Reciprocity Approach 

The two patterns are seen to compare well. The H*M approach pattern has a slightly higher back lobe 
level, most likely due to the truncation of the radiating aperture area. If the aperture area is increased, the 
relative energy transmitted in the forward direction will increase resulting in a reduced back lobe. Since the 
patterns compare well, the selected aperture size is considered sufficient. 

5. CONCLUSIONS 

This paper has presented an efficient method for determining radiation patterns for arbitrarily-shaped 
conformal antennas. Several examples were presented illustrating the procedure for computing a pattern using 
the reciprocity approach. The technique was first verified against known radiation patterns, then extended to 
more complex structures. The pattern produced by a cavity-backed elliptical patch antenna was also computed 
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and compared to a pattern constructed using an alternate method. The patterns were shown to be in good 
agreement. 

The paper shows how results obtained for the mounting structure can be used for any radiating element 
shape and location on the structure in the computation of the radiation pattern. This feature of the technique 
results in significant improvements in computational efficiency when the shape or location of the radiating 
element is varied. Further work is being pursued on curved radiating patches using an extension of the present 
locally-conformal FDTD algorithm for the magnetic current calculation. The effect of curvature on the flat 
patch approximation to pattern computation will be addressed and quantified in terms of finding the limit when 
the approximation is no longer accurate. 
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1. ABSTRACT 

A radio navigation system on a small aircraft was unlocking on approach to a number of Australian 
military bases. Flight trials to establish the cause of the problem proved inconclusive. Air Operations 
Division (AOD) of the Defence Science and Technology Organisation (DSTO) was asked to 
investigate the problem, to review the flight trials, and to resolve the issue. 

An approach was developed to integrate Geometrical Theory of Diffraction (GTD) modelling of the 
electromagnetics problem, using a simplified flight model, and a Monte-Carlo simulation, to allow an 
informed decision to be made on the antenna location based on operational needs. The paper describes 
the process undertaken, the results of the modelling process, the Monte-Carlo simulation results and 
the measure of performance chosen for this particular problem. The paper also describes how this 
approach may be applied to other circumstances. 

2 . INTRODUCTION 

Finding the optimum location of antennas on aircraft is notoriously difficult, especially so on military 
aircraft. Location decisions are often dominated by engineering or aerodynamic constraints, partly out 
of need, but also by default, because of difficulty in assessing the operational implications of a 
particular location. Excellence in radio navigation system performance, based on operational 
effectiveness, is all an aircraft operator really cares about. Optimum antenna location is a small but 
critical part of good system performance. 

Computational methods for analysis of antenna/aircraft interaction are available but they are large 
consumers of processing time. However, recent improvements in the performance/cost ratio of 
computers have made feasible the proper analysis of the likely behaviour of an antenna on a given 
aircraft. 

Aircraft/antenna interaction is commonly quite complex, mainly due to the size and shape of the 
aircraft compared with the wavelength of the transmission. Complex interaction produces complex 
radiation patterns, hence the conventional radiation plots are completely inadequate. The use of 
radiation patterns in two or three dimensions provides considerably more insight, however their 
interpretation is still difficult and the operational impact of the interaction is not revealed. 
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To investigate the performance of a radio-navigational (RN) system, two-dimensional radiation 
patterns were calculated for two possible antenna locations on a small single-engine aircraft, at a 
frequency of 1000 MHz. It was found that the patterns are of limited value in selecting the correct 
location, due to difficulties in interpretation and the unknown impact of the location on operational 
performance. This paper presents a solution to this problem by the integration of a computed antenna 
pattern with typical approach profiles in a Monte Carlo simulation. This technique gives a measure of 
performance of the system in flight to allow a choice based on operational effectiveness. Using this 
technique, the efficacy of various locations can be compared by analysing the antenna performance 
over thousands of simulated RN instrument approaches, providing more comprehensive information 
at considerably lower cost than actual flights. 

In time, it is expected that this approach will aid in the locating of antennas during aircraft design, and 
also will provide an alternative method of presenting this complex information to aircraft operators, 
pilots and military mission planners. For the specialist in Computational Electromagnetics (CEM), 
this paper emphasizes the need to present the results of computation in understandable and computer 
effective forms. 

3. MODELLING RADIATION PATTERNS OF ANTENNA 

3.1. Computational Electromagnetics Codes 

There are many computational techniques that can be employed in solving electromagnetic problems. 
(See [1] for a review). For antenna radiation pattern analysis, the most important techniques are the 
Method of Moments (MOM) and the Geometrical Theory of Diffraction (GTD). The MOM technique 
[2] involves the creation of a model of the structure under investigation with short wire segments. A 
matrix solution of the impedance equation allows the current in each segment, and hence by 
summation, the total electromagnetic (EM) field to be calculated. Unfortunately, this technique is not 
suitable for analysis of electrically large structures (i.e. large compared with wavelength), because the 
number of segments and hence computing time required for solution grows rapidly. For a moderately 
complicated structure of 1500 wire elements, the frequency limit is approximately 300 MHz. This 
limit is remarkably independent of computing power, since solution time varies as the fourth power of 
frequency when creating three-dimensional models. 

The second technique, GTD, is a ray-tracing method with its roots in Physical Optics [3]. It is only 
suitable for electrically large structures, and is the technique used in this paper. The structure under 
investigation is represented by a series of basic geometrical shapes ie flat plates and cylinders. It is 
illuminated by EM waves and the reflected and diffracted radiation at any point can be found and 
summed to find the total field. 

The software code GEMACS (Generalised Electromagnetic code for the Analysis of Complex 
Systems) implements both MOM and GTD, together with other techniques [4][5]. GEMACS has been 
used for the calculation of the antenna patterns in this paper. 
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3.2. GTD model of aircraft 

A GTD model of the front underside of a typical single-engine propeller-driven aircraft was created 
using 40 flat plates, and 9 cylinders. The model is shown in Figure 1. The model has been reduced to 
the minimum number of plates and cylinders as this is the most effective method for controlling 
computational speed without sacrificing antenna pattern accuracy. We chose to model the type of 
aircraft which has a cavity and doors associated with the front nose wheel, but whose main 
undercarriage doors retract flush with the wing surface. This was to best represent a typical high speed 
single engine aircraft. 



Figure 1: GTD model of a small single engine propeller 
aircraft, and the locations of the two antennas (A & B). 
The antenna location A is slightly off centre and behind 
the line connecting the main wheels. The antenna 
location B is further towards the rear of the model and 
along the centreline of the fuselage. 



Figure 2: The original location 2D antenna pattern 
looking toward the aircraft with the undercarriage down. 
Dark areas are 60dB down on the light areas. 


3.3. A quick look 


The first attempt at looking at this problem was to use the model to evaluate the field that could be 
seen if you were to look directly at the front of the aircraft. While the RN system is primarily a 
receiver, the principle of reciprocity has been used to simplify the computational workload. The 
results of this first attempt are shown in figure 2. Confidence in these results was obtained by a 
process of modeling the aircraft without undercarriage, with nose-wheel but no bay doors and with 
bay doors open but no nose-wheel. The additional modeling was done to assess the sensitivity of the 
model and the modeling code to model features. The results of figure 2 clearly show that small 
changes in the aircraft attitude have the potential to produce large changes in the received signal 
strength. Attitude changes in the order of one degree can change the received signal by approximately 
50 dB. This evidence was sufficient to indicate why the flight trials had been inconclusive and 
demonstrated the need for a more thorough investigation of the problem. These results also 
demonstrated the need to move the antenna to a better location. 


3.4. Radiation patterns for two antenna locations 

Spherical sources of radiation emitting at 1000 MHz (using a GEMACS command) were located at 
each of two possible antenna sites, and the far field radiation patterns were calculated every degree for 
a range of angles: -120 to 120 degrees in azimuth, and -20 to 20 degrees in elevation (relative to the 
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horizon). Each radiation pattern took approximately 100 hours of computing time on a fast 
workstation (SUN Ultra 2, rated at 505 SPECfp92). 

The radiation patterns obtained are shown in figure 3 as greyscale plots. These patterns are complex, 
but provide abundant information more readily than a series of traditional polar diagram plots. 

The two patterns are very different. Pattern B is symmetrical, as might be expected from the central 
location of the corresponding antenna. Likewise, pattern A is decidedly asymmetric. Pattern A shows 
more high elevation absorption than B, as might be expected from its location in line with wings, 
therefore performance in turns will be poorer. Pattern B only shows high elevation absorption in the 
forward direction, when the fuselage is in the line of sight of the transmitter. Performance in turns for 
location B should be better. In addition, both patterns show attenuation due to the main landing 
wheels [for A at (-70,-18) and (65,-18) and B at (±40,-18)] and the front nose wheel [for A at (0,-12) 
and Bat (0,-9)]. 

Other than making gross observations like the above, it is difficult to see how antenna performance 
differences between the two locations will affect subsequent navigational system performance. To try 
and extract more useful information from antenna patterns, Monte Carlo simulations of typical 
instrument approaches were performed, and coupled with the radiation patterns to obtain predictions 
of operational, in-flight antenna performance. 



Figure 3: Greyscale antenna radiation plots, for the two antenna locations (a) A and (b) B, showing attenuation of the 
antenna signal over a wide area. (0,0) is straight ahead, positive elevations are above the nose, and positive azimuths are to 
starboard (right). The scale represents a range of -60 dB, from light OdB to dark -60dB. 

4. SIMULATION OF ANTENNA PERFORMANCE 

4.1. An example instrumented approach 

An approximate flight path, typical of RN instrument approach, with representative random 
perturbations superimposed was modelled. No attempt has been made to accurately simulate an 
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aircraft approach, nor have we utilised an accurate flight model or wind models to calculate the 
attitude of the aircraft with high-fidelity. 



Figure 4: Schematic representation of an example instrument approach (full line) and nominal approach (dashed). The 
vertical scale is multiplied by a factor of 10. 


The nominal flight path (shown dashed in Fig 4), follows an arc of radius 20 km at not less than 3000 
feet (900 metres) altitude. The aircraft then turns towards the beacon a few degrees in advance of the 
heading of the runway, A 3 degree glideslope is followed, so that by the Missed Approach Point 
(MAP), about 2 km out, the aircraft is above 500 feet (170 metres) altitude. At this point the pilot 
normally decides whether to proceed with the landing, in which case he aligns with the runway, 
otherwise the landing is aborted. The flight path beyond the MAP has not been considered, since after 
this point the RN system has lower operational importance until the next approach. 

We have used an algorithm to make the aircraft follow the nominal flight path, allowing random 
deviations of the order of a hundred metres laterally and in height. Simple feedback rules provide an 
additional level of realism: the aircraft is encouraged to return to the flight path more strongly when 
below the nominal height than when above it; and stricter adherence to the nominal flight path is 
prescribed when the aircraft gets closer to the beacon. A simulated approach is given in Fig. 4. 

The aircraft’s position is sampled every 100m, which is appropriate given the resolution of the 
antenna radiation patterns. For each position, the aircraft’s attitude is initially oriented as follows: the 
heading is made to coincide with the track the aircraft is following; the wings are made level; and the 
pitch is set to zero. On top of this are then superimposed fluctuations in roll, yaw and pitch, which are 
deviates chosen from normal (Gaussian) distributions of standard deviations of 2.5, 2.5 and 5 degrees 
respectively. These values represent realistic perturbations for a single engined aircraft in a realistic 
atmospheric environment. Once the attitude of the aircraft is fixed, the apparent location of the beacon 
is calculated in the coordinate frame of the aircraft, and finally the signal attenuation is calculated 
from databases representing the radiation patterns given earlier, together with an inverse-square 
correction. 

These simplifying assumptions reduce the fidelity of the simulation of the flight but result in a 
computationally fast program with sufficient fidelity. In this way, demonstration of the technique and 
what it has to offer in terms of predicting antenna performance can be achieved. 
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For a given flight, the antenna signal strength as a function of time, or in this case total distance 
travelled can be computed. Two such plots are given in figure 5 for the two antennas. 


Antenna Location A Antenna Location B 




(a) (b) 

Figure 5: Plot of antenna signal strength as a function of distance flown by aircraft for antenna in location (a) A and (b) B. 

The shape of the plots are similar: a level of OdB is maintained for the first 20km, before steadily 
increasing towards +40dB for the next 20km. This corresponds to the arc phase and final glideslope 
respectively. The antenna in position A shows a higher number of large signal drops when following 
the arc. This is consistent with the expectations described above because in this position the aircraft 
has the beacon located at +90 degrees in relative azimuth, and the drops occur when the aircraft rolls 
more than a few degrees towards the beacon, shielding the antenna with the wing. The antenna in 
position B shows a higher number of moderate signal drops when following the glideslope. This is 
consistent with the dark areas of figure 2(b) for nose down pitch attitudes. 

Plots such as those in figure 5 still give very little insight into the correct place to locate the antenna, 
because the operational aspects of the system were not included in the analysis. Both locations have a 
modest number of large signal drops but at different phases of the flight. Further processing of the 
Monte-Carlo results is necessary. 

4.2. Measuring Performance 

If the simulated approach is repeated many times, we are able to derive statistical information or 
average behaviour of the antennas and hopefully the attached RN systems. The statistical information 
will only have meaning if it describes some measure of the performance of the system. In this case we 
suspected that any automatic gain control in the RN systems would have considerable difficulty in 
coping with the large, rapid fluctuations in the received signal. The measure of performance was 
chosen to be the lowest probability of large fluctuations in the received signal during a flight. 

We performed 1000 approaches, to obtain a series of data like the examples shown above. Signal 
fluctuation is defined as the magnitude of the difference in signal obtained between two consecutive 
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aircraft positions separated by the 100m sample distance, ie the rate of change of the signal with 
respect to distance (hence time). In figure 6, we plot the cumulative number of fluctuations as a 
function of the fluctuation size, expressed as a fraction of the total number of fluctuations (N). 

If the RN system was able to cope with fluctuations up to 30 dB within the time taken to fly 100m, 
then an antenna location that produced the lowest probability of fluctuation at and above 30 dB would 
give the best system performance. For this case, fluctuations greater than 30 dB are seen 3% of the 
time for antenna location A, and only 0.3% for location B. Therefore if the RN system has 
performance problems with 30 dB signal variations in short distances (times), the best antenna 
location is B. In general, the antenna in location A shows more fluctuations of all sizes than location 
B, as curve A is always above curve B. From this analysis it is expected that B would be the preferred 
location for such an antenna. 



Figure 6: Cumulative distribution of signal fluctuations for antenna locations A & B. The fluctuation is the change in 
signal from sampled location of the aircraft to the next sample (100 metres apart). 


Another example of the use for this method is to answer the question “How many times does the 
system drop out due to a loss of signal?” For this case the data used for plots of figure 5 are used 
directly and the cumulative number of signal drops below a receiver threshold are compared over a 
significant number of flights. This gives a different measure of the influence of antenna position on 
antenna performance and system performance, and may lead to a different location considered to be 
better. 

4.3. Other Factors 

Conventional computational approaches to examine the coupling between the chosen location and 
other antenna systems also needs to be completed. The optimum place for this in the smart antenna 
siting process is highly situation dependent. In this case, the aircraft did not have any other systems in 
the RN band so the choice of location was not complicated. 
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In all cases of this modeling approach it is necessary to examine the sensitivity of the modeling to 
model error and the plethora of simplifying assumptions. In most cases this can only be done by 
changing model parameters and repeating the process. The single factor most likely to produce a poor 
result for this method is the choice of performance measure. 


4.4. Proof of the concept 

This work was completed for a real aircraft problem. The modification suggested by this work was 
trialed in a series of flights, along with other possible solutions. The antenna location that performed 
the best was position B as expected. This modification has now been included in the entire fleet. 

5. CONCLUSIONS 

It is possible with modem CEM codes and reasonable computing power to obtain high frequency 
radiation patterns of antennas located on a relatively complex structure. Two-dimensional plots in 
themselves represent an advance in presenting such data over a series of traditional polar diagrams, 
but are not sufficient for system level decision making. 

In addition, with a Monte Carlo simulation of a typical RN aircraft instrument approach, useful 
operational information about the performance of the antenna and its system can be obtained. This 
presents both a useful tool for choosing the initial location of the antenna, and an improved way of 
representing installed antenna performance data to the aircraft operators. The method presented here 
can also be used to predict installed antenna performance and justify changes to the position of 
antennas in poorly performing systems based on sound quantitative analysis. In general the method is 
not limited to aircraft navigation systems, it can be applied to a large variety of communication and 
surveillance problems. 
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Abstract 

Conformal antennas are an essential part in future communication and sensor systems as the 
integration of antennas in curved surfaces is requested more and more. Most of these conformal 
antennas have to work in a phased array manner using different excitation amplitudes and 
phases for each array element. The synthesis of this excitation coefficients requires exact 
knowledge about the coupling within the array, especially when small beamwidth or low side 
lobe levels are desired. 

Aperture coupled feeding technique is an excellent choice if broadband patch antennas are 
needed. However, up to now, no data about the coupling behaviour of these aperture coupled 
antennas were published. 

This paper presents measured scattering parameters for four combinations of polarization and 
position. The different coupling phenomena encountered during the measurements are identified 
and rated according to their influence on the excitation of the single elements. Finally some 
conclusions for the design of this type of antenna are drawn. 

Introduction 

Future broadband communication and sensor systems require efficient low profile antennas 
over a reasonable frequency bandwidth, which need to be integrated in existing curved surfaces 
either to hide the antenna due to aesthetically reasons or to minimize drag for aerodynamically 
reasons or both. Besides the conformal fit to the given surface the antenna should offer a variety 
of beamforming and beamsteering capabilities some of which are difficult to obtain with fixed 
planar antennas. 

A good approach to this variety of requirements is a microstrip patch antenna with a aperture 
coupled feeding, as depicted in fig. 1, to achieve broadband behaviour. As most of the surfaces 
show primarily a curvature in one direction, e.g. a wing of a plane, naturally a cylindrical setup 
is the first one to be studied. 

For a phased array operation of this conformal antenna the knowledge of the coupling in the 
array is mandatory if narrow beams or low side lobes are desired. In this contribution the 
coupling within an aperture coupled microstrip patch antenna on a cylindrical surface is studied 
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for horizontal ((^-directed) and vertical (z-directed) 
polarization with different element spacing in 
horizontal and vertical direction. 

Four geometry types 

In planar microstrip patch phased arrays the 
coupling is distinguished in E- and H-plane 
coupling, respectively. This is simply due to the 
fact that from the 2 parameters polarization and 
geometry, which determine the coupling, the 
geometry is invariant to rotations around a axis 
perpendicular to the array and therefore only the 
polarization is left as parameter. 

However, in cylindrical arrays the geometry as 
well as the polarization have to be taken into 
account to determine the coupling. The 4 principal 
arrangements are depicted in 
fig. 2. Fig. 2a shows a E- 
plane coupling along the 
curvature, fig. 2d shows the 
E-plane coupling along the 
not curved direction. The H- 
plane coupling along the 
curvature is depicted in 
fig. 2c whereas the H-plane 
coupling along the non- 
curved surface is shown in 
fig. 2b. 

These four different cases 
have to be treated separately. 

Additionally, the position of 
the slot relative to the 
cylinder's height is an 
important parameter, as the 
measurements show. 


Measured coupling 

To study the coupling effects 
for the different geometries 
according to fig. 2 under 
practical conditions, 4 Fig 2: 
different sets of 2 element 
arrays have been built and 
characterized. Fig. 3 shows 
the 5 measured setups for the 



Fig. 1: 


3D-view of an aperture coupled 
patch on a cylindrical surface 





The 4 different arrangements of geometry and polarization: a) 
<f>-polarization with <f>-directed element spacing, b) (j)- 
polarization with z-directed element spacing, c) z-polarization 
with <p-directed element spacing, d) z-polarization with z- 
directed element spacing 
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(j)-polarized 2x1 array with 
different element distances in 
^-direction. The realized 
element distances in <J>- 
direction are 0.4 Xo (69.2°), 
0.5 o (86.5°), 0.7 Xo 
(121.2°), 0.9 X 0 (155.8°) and 
1.04 Xo (180°) where the 
distance is measured between 
the center of the elements 
along the circumference in the 
"patch plane". 

The 6 measured setups for the 
z-polarized 1x2 array with 
different element distances in 
z-direction are depicted in 
fig. 4. The realized element 
distances in z-direction are 
0.35 X 0 , OAXq, 0.5 X 0 , 
0.7 Xo, 0.9 Xo and 1.04 Xo 
where the distance is measured 
along the z-direction between 
the center of the elements. 

The geometry of the 
cylindrical structure was the 
same for all the setups and 
consists of a central metallic 
cylinder (r = 7.5 mm), a 
foam spacer of 22.5 mm, a 
RT/Duroid 5880 feeding 
substrate (0.508 mm) and a 
10 mm foam layer as patch 
substrate. The patches 
themselves are etched on a 
35 Jim kapton-foil and glued 
to the foam material. The 
height of the cylinder was 
h= 180 mm. As the design 
frequency is 2.45 GHz all 
distances are normalized to 
that particular wavelength. 



Fig . 3: Aperture coupled antenna structures for the measurement of 

the coupling in <fhdirection in a ^-polarized array according 
to fig-2a. The carrying structure in the middle shows a single 
element and is surrounded by the bent feed network substrate 
(the slots can be seen very clearly) and the patches on their 
foam substrate . The realized distances are (from the left to 
the right) 1.04 Xq, 0.9 Xq, 0.7 Xq, 05 Xo and 0.4 Xq. 



Fig. 4: Aperture coupled antenna structures for the measurement of 
the coupling in z-direction in a z-polarized array according to 
fig. 2d. The carrying structure in the middle shows a single 
element and is surrounded by the bent feed network substrate 
(the slots can be seen very clearly) and the patches on their 
foam substrate. The realized distances are (from the left to 
the right) 0.35 Xq, 0.4 Xq, 0.5 Xq, 0.7 Xq, 0.9 Xq and 1.04 Xq. 
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Input reflection coefficient 

As known from planar arrays coupling 
can have severe influence on the input 
reflection coefficient of the array 
elements [1]. That also happens to the 
cylindrical array. The measured input 
reflection coefficient for the <[>- 
polarized elements as a function of the 
E-plane distance around the 
circumference is plotted in fig. 5. In 
the frequency range between 2 GHz 
and 2.5 GHz the same behaviour as in 
planar arrays occurs, coupling shifts 
the peak of the return loss towards 
higher frequencies. Between 2.5 GHz 
and 2.85 GHz, however, this 
phenomenon is dominated by the 
coupling of the slots to the TEj \ i 
resonance of the multilayer coaxial 
cavity inside the antenna. Furthermore 
the circumference at that frequency 
reaches a value of 2 wavelengths so an 
additional resonance in the patch 
substrate occurs. Above 2.85 GHz the 
input reflection coefficient again 
behaves like in a planar array with the 
exception of the resonance at 
3.5 GHz, which is the TEi 14 
resonance of the coaxial cavity coupled 
to primarily by the microstrip feed 
lines. 

Fig. 6 shows the influence of H-pIane 
coupling on the return loss along the 
planar surface of the cylinder. As in the 
planar case the input reflection 
coefficient is less affected by H-plane 
coupling what leads to nearly identical 
curves below 2.4 GHz. The coupling 
to the TEin resonance at 2.6 GHz is 
weaker than in the case of fig. 5. This 
is due to the fact that the slots are 
moved symmetrically upwards and 
downwards (see fig. 4), what reduces 
the coupling of the slots to this mode. 
But the same movement increases the 



Frequency / GHz 

Fig. 5: Return loss of a (p-polarized element in a 2x1 
element array for (p-directed element spacing of 
0.4 Xq, 0.7 Xq and 1.04 Xq. 



Fig. 6: Return loss of a (p-polarized element in a 1x2 
element array for z-directed element spacing of 
0.4 Xq, 0.7 Xq and 1.04 Xq. 



Fig . 7: Return loss of a z-polarized element in a 2x1 

element array for (p-directed element spacing of 
0.4 Xq, 0.7 Xq and 1.04 Xq. 
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coupling to the TEM 4 resonance at 
3 GHz. Also here the TEn 4 resonance 

3.5 GHz occurs, but this time the slots 
are more involved in the coupling to 
this mode what leads to a higher 
distance dependence in the form of the 
resonance peak. 

The return loss of fig. 7 again is a 
function of H-plane coupling, which, 
like in the case of fig. 6 , does not 
influence much the input reflection 
coefficient. For a distance of 0.4 
the peak of the return loss tends to shift 
towards higher frequencies. The peak 
at 3 GHz is caused again by weak 
coupling to the TEM 4 resonance by the 
feed lines. The feed lines also cause a 
coupling to the TE 113 resonance at 
3.25 GHz. 

Fig. 8 shows an input reflection 
coefficient influenced by E-plane 
coupling along the planar z-direction. 
The frequency shift of the input 
reflection coefficient can be estimated 
by the form of the curves above 

2.6 GHz. At the design frequency 
however, the coupling to the TEM 3 
resonance, which strongly affects the 
0.4 Xo distance, hides any other 
effects. The peaks around 3 GHz, 
3.25 GHz and 3.5 GHz again mark 
the coupling to the TEM 4 , TE 113 and 
TEi i 4 resonance. 

Coupling between elements 

The coupling in the E-plane around the 
circumference of the cylinder is plotted 
in fig. 9. Generally the coupling is 
quite high reaching values above 
-5 dB. The coupling generally drops 
with increasing distance slower than in 
the H-plane, what is well known from 
planar arrays. The sharp peak around 

2.6 GHz is due to the TEj 1 1 
resonance which influences the 
coupling according to the location of 
the feed lines and slots. This geometry 



Frequency / GHz 


Fig. 8: Return loss of a z-polarized element in a 1x2 
element array for z-directed element spacing of 
0.4 Xq, 0.7 Xq and 1.04 Xq. 



Fig. 9: Coupling between 0 -polarized elements in a 2x1 
element array for <p-directed element spacing of 
0.4 Xq, 0.7 Xq and 1.04 Xq. 



Fig. 10: Coupling between (p-polarized elements in a 1x2 
element array for z-directed element spacing of 
0.4 Xq, 0.7 Xq and 1.04 Xq. 
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dependence is even more visible 0 . 

around 3.5 GHz at the TEi 14 _| | ] : 

resonance. Due to the field j I 

configuration of the TEji mode the r ]""* 71 | 

coupling for the 180° (1.04 Xq) .|. 1 

separation (-4.3 dB) is higher than for f ' 20 ~r ....L.j.: 

the 0.7 Xo (121.2°) separation s -25 -.f..V—^ f\- - 

(-15 dB). -30 i|—-Atef. j .V- ~°‘ 40X ° I 

Fig. 10 shows H-plane coupling along . 35 .4. ~ 

the planar z-direction. The coupling is , , yfeWrVl /?! . * . j 

generally lower than in the case of 2 2,5 3 3.5 4 

fig. 9 but higher than in fig. 10. The Frequency /ghz 

resonances again disturb the global Fig. 11 ; Coupling between z-polarized elements in a 2x1 

behaviour of the curves. This time the element array for <p-directed element spacing of 

T E 111 resonance decreases the 0.4 Xq, 0.7Xo and 1.04 Xq. 

coupling as the elements are moved 0 _ 

symmetrically upwards and ! i 

downwards to establish the desired ^ ^ ^^ ji 

separation (see fig. 4). The TE 112 10 f y jp 1 ^ 1.; 

resonance around 2.75 GHz, the 'I 5 r-r- v|f .J J^.. L.- 

TEM 4 resonance around 3 GHz and § * 20 rr f- "^v-w||~t--j-- ; 

the TE 114 resonance at 3.5 GHz show 5 -25 L . -JL .C-j-jf. . 1 

very geometry-dependent behaviour of -30 ! i ._— a40X <| 

the coupling, sometimes inverting the , 35 .j/AL.l..| v ^4i-iv.J. '--4 a70X °[ 

"normal" behaviour of the curves H If (j f\! | v . 1,04 4 

(TEi 14 resonance: 1.04 Xq separation 2 2.5 3 3.5 4 

compared to 0.4 Xo separation). Frequency /ghz 

The H-plane coupling around the Fig. 12: Coupling between z-polarized elements in a 1x2 
circumference of the cylinder depicted element array for z-directed element spacing of 

in fig. 11 shows the lowest coupling 0.4 Xq, 0.7 Xq and 1.04 Xq. 


Frequency/GHz 

Coupling between z-polarized elements in a 2x1 
element array for (p-directed element spacing of 
0.4 Xq, 0.7 Xo and 1.04 Xq. 
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circumference of the cylinder depicted element array for z-directed element spacing of 

in fig. 11 shows the lowest coupling 0.4 Xo, 0.7 Xo and 1.04 Xo. 

encountered during these 

measurements. The coupling drops very fast with increasing distance and is less affected by 
resonances. Only 2 resonances at 3 GHz (TEM 4 resonance) and 3.25 GHz (TE 113 resonance) 
occur, because the polarization (orientation of the slots) and the location of the slots are bad for 
coupling to the resonances in the coaxial cavity. 

The E-plane coupling along the planar z-direction is depicted in fig. 12. The coupling in this 
plane is stronger than in both H-planes but generally weaker than the E-plane coupling around 
the circumference depicted in fig. 9. Like in fig. 11 for the H-plane the z-polarization also 
shows in the E-plane less sensitivity for the resonances. Also here all the resonances (TEM 3 at 
2.45 GHz, TEM 4 at 3 GHz, TE 113 at 3.25 GHz and TE 114 at 3.5 GHz) show a geometry- 
dependence of the coupling. 


The different effects 


The dominant coupling effect between these cylindrical aperture coupled patches is the 
resonance, either in the coaxial cavity or as a circumferencial resonance in the patch substrate 
coupled to by surface waves. This effect is narrowband and geometry-dependent [2]. 
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The other more broadband effect is the 
propagation of surface waves, which 
causes the difference between E- and 
H-plane. Fig. 13 shows the nearfield 
of a ^-directed dirac current source 
(Green's Functions in space domain) 
to vizualize the propagation of a 
surface wave. The plot clearly shows a 
difference in magnitude of the nearfield 
between the direction parallel to the 
current and perpendicular to the 
current. The field and hence the o 
coupling is much stronger parallel to 
the source, which of course is the E- 
plane. 

In the cylindrical case this surface 
wave pattern in conjunction with the 
finite <j>-direction causes high coupling 
because the surface wave energy 
"radiated" in the (|)-direction can not be p ig 
lost, so it stays within the system and 
causes coupling. 

Other coupling effects (like radiation, 
etc.) are negligible. 

Conclusions 

The coupling phenomena within an cylindrical aperture coupled microstrip patch antenna are 
more complicated than in a planar microstrip patch antenna. As in the planar case the coupling in 
the E-plane is generally stronger than the coupling in the H-plane. Within both planes (E-plane 
and H-plane) the ^-polarization causes higher coupling than the z-polarization, because the 
energy is somehow trapped in the (^-direction. However, this effect is weaker than any 
coupling to a structural resonance, which dominates the coupling behaviour in the 
corresponding frequency range. 

Comparing the input reflection coefficient and the coupling of both polarizations the <J>- 
polarization is much more affected by the resonances than the z-polarization because the slot 
orientation of the ^-polarization is more "useful" for the coupling to the resonances. 
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Nearfields of a <fhdirected dirac current sources (Green's 
Functions in space domain) to vizualize the propagation of a 
surface wave. The calculation parameters are chosen to 
show the surface wave effect, hence only a single layer 
substrate is used with a inner radius r: kor - 10, a substrate 
thickness d: kgd = 0.2 and e r - 3.5. 
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Abstract 

Mission Research Corporation (MRC), in collaboration with Michigan State University (MSU) 
and Analytic Designs, Inc. (ADI) is applying unique, state-of-the-art computational 
electromagnetics (CEM) tools to investigate the performance parameters of wide bandwidth, 
multi-modal, conformal antennas. The antenna modeling is coupled with the CEM via a user- 
friendly graphical user interface (GUI). We will discuss our efforts to develop mesh 
generators for these types of antennas, the application of various finite element-boundary 
integral (FE-BI) engines for computation, and the interface through the GUI. We will also 
discuss the use of these special meshes with our FE-BI CEM tools to predict antenna 
performance parameters such as radiation pattern and impedance. This work was 
sponsored by AFRL/SNRP under contract # F33615-98-C-1223. 


Introduction 

The MRC Team’s FE-BI CEM tools permit rapid analysis and design of conformal antennas, 
with particular emphasis on wide bandwidth, multi-modal antennas. These antennas are 
cavity-backed and flush-mounted on planar, cylindrical, and doubly-curved surfaces. Our 
CEM tools are based on the finite element-boundary integral (FE-BI) method. This CEM 
technology is uniquely capable of modeling complex geometrical and material properties. 
Below we present a brief summary of the FE-BI method. 

Finite Element Boundary Integral (FE-BI) Method 

Hybrid finite element methods (HFEM) have been explored for several years, most often 
being called either the finite element-boundary integral (FE-BI) or the finite element-integral 
equation (FE-IE) method. Both combine finite element and integral equation techniques. 
Unfortunately, a general FE-IE method is not currently feasible due to heavy computational 
resource demand. In such a case, electric and magnetic currents are needed on the mesh 
boundary as well as the three components of the interior field. All IE formulations result in full 
matrices due to global coupling; hence, the two currents on the mesh boundary impose an 
excessive computational burden in terms of memory and compute cycles. Therefore, FE-IE 
formulations typically are restricted to specialized situations such as cavity-backed patches 
or slots recessed in an infinite metallic surface. In this case, the FE-IE method can be 
formulated such that it requires only one current (magnetic) over the apertures located within 
the ground plane or circular cylinder [1). Accordingly, the memory and computational 
requirements associated with the mesh boundary is minimized. In addition, very large 
apertures can be efficiently modeled by exploiting Toeplitz symmetry, present in the IE sub¬ 
matrix, provided these apertures lie in a metallic ground plane or circular cylinder and uniform 
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sampling is used on the mesh boundary. Figure 1 illustrates the application of the FE-IE (or 
FE-Bi) method. 

The MRC Team has been a leading innovator in custom FE-BI solutions for antenna analysis 
[2]. FE-BI is the only leading CEM technology capable of providing the geometric fidelity, 
efficiency, and accuracy required for demanding conformal antenna design and 
characterization. 



Figure 1: Application of FE-BI to a conformal antenna. Note the reduced computational volume. 
Anisotropic Planar FE-BI Computer Codes 

Our planar codes all use the half-space Green’s function and have basically the same 
capabilities given by: 

• Plane wave incidence 

• Filamentary feed models 

• Radar cross section calculation 

• Power gain calculation for radiation analysis 

• Voltage at feed pins due to plane wave reception 

• Input impedance calculation 

• Anisotropic materials 

• Resistive (R-card) treatments 

Anisotropic materials can be specified by fully populated symmetric or asymmetric 
permittivity or permeability tensors. 

Brick Elements: 

MRC developed and delivered a Fortran computer code called LM_BRICK. This FE code 
uses brick elements to discretize the volume as well as expand the unknown electric field. 
Validation has been performed by comparison with XBRICK results for a patch antenna for 
both scattering and radiation. 

Prism Elements: 

MRC developed and delivered a Fortran computer code called PRISM. This FE code uses 
right prism elements to discretize the volume as well as to expand the unknown electric field. 
PRISM also has the added capability to model superstrates and substrates. This code has 
been run on many of the antenna models and validated against existing data when available. 
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Tetrahedral Elements: 

MRC developed and delivered a Fortran computer code called TET. This FE code uses 
tetrahedral elements to discretize the volume and expand the unknown electric field. TET 
also extends the capability of PRISM by including the capability to model non-planar cavity 
bottoms. This code has been run extensively on the LPA and validated against existing data. 


Anisotropic FE-BI Cylindrical Computer Code: 

To investigate the effects of curvature, MRC developed and delivered a tetrahedral element 
code for a cylindrical geometry. This code uses a special cylindrical tetrahedral element and 
the cylindrical Green’s function for the boundary integral. This code is an improvement over 
the technique in [1]. 

Automatic Mesh Generators 

We have developed a stand-alone C language code for each type of antenna we are 
modeling. These codes are integrated with a graphical user interface (GUI) currently under 
design by ADI. The basis of the mesh generators is Carnegie-Mellon's (CMU's) TRIANGLE 
Planar Surface Mesher. It is written in C (source code is freely available), uses X-Windows 
interface for mesh viewing, and we currently have it running on our SGI and other machines 
using Linux. We have developed mesh generators for the log-periodic, the equiangular 
spiral, the archimedean spiral, the sinuous spiral, and the Vivaldi notch. We also have 
generators for rectangular and circular patch antennas. The antenna codes are actually 
mesh generator interfaces used to create the input files used by TRIANGLE. TRIANGLE 
creates the triangular mesh, based on the input parameters specified by the user when a 
specific antenna code is run. SHOWME, also developed by CMU, is used to view the 
resulting mesh. Once the mesh files have been generated by TRIANGLE, an MRC 
developed code (INTERFACE) is used to create an input file for PRISM, TET, or CYL_TET 
(our FE-BI codes). The FE-BI code is then used to calculate various electromagnetic 
parameters such as input impedance, radiation patterns, and far-field patterns. A discussion 
of these meshers can be found in [3] 

Our discussion will concentrate on the circular antennas. The associated polarization of the 
circular antennas is given below: 

• Circular log-periodic (linear polarization) 

• Spiral (circular polarization) 

• Sinuous Spiral (dual circular or linear polarization) 


Log-Periodic Geometry: 

MRC developed a mesh generator for the Log Periodic Antennas (LPAs) with TRIANGLE. 
We can model a 4-arm LPA with any desired number of teeth. An example of a 4-arm LPA is 
shown in Figure 2. 
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Figure 2: Log Periodic geometry and associated mesh 
Spiral Mesh Generators 

Mesh generators have been developed for 2-arm, 4-arm, and 8-arm equiangular and 
archimedean spiral antennas. We have also developed a mesh generator for a 4-arm 
sinuous spiral antenna. The user can specify typical spiral antenna attributes such as the 
antenna flare rate, maximum amount of rotation of the spiral arms, and initial distance 
between the antenna arms. Figure 3, Figure 4, and Figure 5 show examples of the various 
spiral antenna models and their associated meshes. 



Figure 3: Equiangular spiral geometry and associated mesh 



Figure 4: Archimedean spiral geometry and associated mesh 
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Figure 5: Sinuous spiral geometry and associated mesh 
Graphical User Interface (GUI) 

The GUI is being designed and implemented by ADI. It has been originally designed to run 
on SGI machines; however, we are also porting it to Linux machines. It consists of various 
pop-up screens that guide the user through setting up the geometry, generating the mesh, 
selecting the appropriate FE engine to solve the problem, provide input to the selected FE 
engine, and then provide post processing. Figure 6 shows the initial pop-up window. 
Selections are made from here. 



Figure 6: Initial Pop-up window. Selects geometry, mesh generation , FE engine, and post 
processing 

Next, we pick “Spiral” to generate a spiral antenna geometry. From this window, another 
window is displayed that enables selection of the various antenna parameters. This is shown 
in Figure 7. 
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Figure 7: Pull down menu for constructing a spiral geometry. Select spiral parameters. 

Next, the mesh needs to be generated. Figure 8 shows the inputs available through this 
window. The geometry can also be viewed. Figure 9 shows an example of a spiral antenna. 



Figure 8 Mesh menu. Mesh parameters are input here. 



Figure 9: View Geometry button. Displays geometry with “Show Me” 























Antenna Performance Prediction 

We have discussed our efforts to develop an antenna design suite consisting of the antenna 
mesh generators for wideband, multi-modal antennas, choice of FE-BI techniques, and post 
processing options. The meshes enable accurate and efficient modeling of the performance 
for these antennas using the FE-BI method. These are all combined into a user-friendly GUI 
to make a useful tool engineers can easily use to determine design parameters that optimize 
performance of these antennas to solve real-world problems. 
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Abstract 

The impact of curvature on the resonant input resistance of a conformal circular log- 
periodic antenna was investigated using the finite element-boundary integral method. 
The antenna was modeled as conformal to the surface of a perfectly conducting, infinite 
circular cylinder. The curvature of the cylinder was varied and the input resistance was 
calculated over a frequency range of 6-20 GHz using a finite element-boundary integral 
program. The behavior of the input resistance with respect to curvature was compared to 
that of a similarly mounted conformal square patch antenna. From this investigation, it is 
evident that the resonant input resistance of the log-periodic antenna is invariant with 
respect to changes in curvature. 

I. Introduction 

Log-periodic antennas belong to a class of broadband antennas for which the 
performance, as defined by the radiation pattern and impedance, is practically 
independent of frequency over a large bandwidth. This characteristic is achieved through 
the specification of an antenna geometry, such that the electrical characteristics of the 
antenna varies periodically as the logarithm of the frequency [1]. The motivation for this 
study lies in the fact that designers may sometimes find it necessary to place conformal 
antennas on the curved surfaces of aircraft, satellites or other types of vehicles. The effect 
of the supporting platform on the input impedance must be considered for matching 
purposes. 

In this investigation, the finite element-boundary integral (FE-BI) method was utilized to 
calculate the input impedance of the antenna. The FE-BI method is versatile in that it 
permits the specification of anisotropic materials within the computational volume and is 
capable of modeling structures with both metallic and dielectric components as well as 
arbitrary shape. The FE-BI method utilizes a boundary integral to terminate the finite 
element mesh by relating the tangential electric and magnetic fields on the mesh surface. 
The FE-BI equations are derived from the weak form of the vector wave equation and are 
given as follows [2]: 


JJvxW, V • VxWj]dV—kj J v [w, I,-wJdV+jk^ 
"k l J s J s [Wj zx5 e2 xz-W j ]dS'dS = f”’ +fT 


(%x Wj-^xwJ 


The first term is associated with the magnetic field, the second term is associated with the 
electric field, the third term accounts for resistive transition conditions, while the last 
term on the left-hand side is the boundary integral term. The two terms on the right-hand 
side are functions of the interior and exterior excitation, respectively [3]. Note that 
Galerkin’s testing procedure is utilized, whereby the basis function Wj is functionally 
identical to the test function Wi. Also note that the boundary condition on the metallic 
surface is implicit in the dyadic Green’s function of the second kind used in the boundary 
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integral term. This formulation 
obviates the need for an equivalent 
electric current, thereby reducing the 
number of unknowns and associated 
computational cost. 

f 

II. Modeling and Simulation g 

A square patch antenna conformal to £ 
the surface of an infinite conducting 
cylinder was modeled using the FE-BI e 
computer model described above. The 
dimensions of the square patch 
antenna are as follows: 2.5cm x 2.5cm 
metal patch with a 5cm x 5cm 
aperture and a .0762cm substrate 
thickness. This patch is illustrated in 
Figure 1. The patch antenna is fed 



Figure 1. Patch antenna and the feed 
points for circumferential and axial 
polarizations. 

along a centerline of the antenna and 
three rotational states are considered: 

0°, 45°, and 90°.This corresponds to: 
axial, mixed, and circumferential 
polarizations. The input impedance ^ 
about the resonance for this antenna is § 
illustrated in Figure 2. As shown, the — 
input resistance’s dependence on the J 

feed location is determined by the | 

rotational state of the antenna. For 
axial and to a lesser extent mixed 
polarizations, the value of the input 
resistance at resonance depends on the 
cylinder curvature. On the other hand, 
for circumferential polarization, the 
input resistance is invariant with 
respect to cylinder curvature. Such a 
dependence is described elsewhere, 



25 26 27 22 3.0 3.3 5-2 3.3 3.5 3.6 V? 


Requeue^ (GHz) 


Impedance vs. Frequency (45deg. rot) 



’ 25 25 27 29 3.0 3J 32 3.3 33 3.6 3.7 


Frequency (GHz) 

Impedance vs. Frequency (chcumfeiential-pol) 



25 2.6 27 2.9 3.0 3.1 32 3.3 3.5 3.S 3.7 


Fioqucacy (GHr> 

Figure 2. Resistance of a patch antenna for three 
different rotations on a cylinder: 0,45, and 90 degrees. 
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though not for the mixed polarization case [4]-[5]. 

The consequence of the rectangular patch antenna’s potential dependence on rotational 
state is a practical one. Care must be taken in the construction of arrays of such antennas 
and in the mounting of the array on a curved surface so that a good match to the feed 
network is obtained. 

Such considerations motivated the subject of this study: the dependence of circular log- 
periodic (LPA) antennas on rotational state. The LPA was chosen since among all wide 
bandwidth, multi-modal antennas in practical use, it appears to be the most likely 
candidate for curvature dependence. This is due to its relative asymmetry with respect to 
rotation as compared to spiral, sinuous spiral, and other multi-mode antennas. A 
complementary four arm LPA has several distinct rotational state possibilities. Three of 
these are illustrated in Figure 3: 0, 22.5, and 45 degrees. 



Case 1 Case 2 Case 3 

Figure 3. Three orientations of the LPA on a cylinder: 0, 22.5, and 45 degrees. 

For this study, the different rotational states can be considered as points on a compass. 
For the 0 degree case, the metallic arms of the LPA are aligned with the N-E-S-W points 
on a compass and the apertures align with NE-SE-SW-NW points on the compass. The 
45 degree case is the complement (e.g. the metal arms align along NE-SE-SW-NW and 
the apertures align along N-E-S-W points). The 22.5 degree case is part way between the 
other two cases. 

For this investigation, the resonance behavior of a four-arm, log-periodic antenna was 
analyzed as the curvature varied. The antenna was modeled as conformal to the surface of 
an infinite conducting cylinder. The following geometrical parameters describe the 
antenna: a=67,5°, p=22.5°, x=0.80, Ro = 0.129 and Ri = 0.3 as described in [6]. A 
homogeneous cavity fill of permittivity e r = 3.2 was used through out this study. An 
infinite circular cylinder provides a computationally tractable representation of the types 
of curved structures on which the antenna could be mounted. A tetrahedral mesh was 
used to described the cavity volume. The impact of curvature on the input resistance for 
three distinct orientations was studied. As illustrated in Figure 3, each case represents a 
distinct rotation of the antenna with respect to the vertical cylinder axis. The excitation 
frequency range used in this study is 6-20 GHz. The antenna was excited at four feed 
points but, for the sake of brevity, only the results at two representative feed points (in 
case 1, the North and East arms) were presented in this paper. The curvature was altered 
by changing the cylinder radius (e.g. p=100cm, 20cm, 10cm and 5cm). The input 
resistance for mode 1 (progressive phase shift of 90°) versus the excitation frequency at 
each feed point was calculated using a FE-BI program and plotted, as shown in Figure 4. 
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Figure 5. Input resistance for various curvatures measured at the East 
feed. 


As seen in Figures 4 and 5, the input resistance for the antenna arms (for case 1) aligned 
along the North and East axes is invariant with respect to curvature. Due to the 
symmetry in this complementary LPA design, we may extend those conclusions to the 
South and West feeds as well. 




























HI. Conclusions 

We conclude that the reason for this invariance lies with the operational mode of a 
circular LPA. As a wide bandwidth antenna, radiation at different frequencies can be 
attributed to different portions of the antenna. The antenna is fed near the center of the 
antenna. Electromagnetic fields are guided by a tapered microstrip transmission line until 
a quarter-wave stub is encountered. At that point, radiation occurs. Accordingly, high 
frequency radiation emanates near the center of the LPA while low frequency radiation 
occurs at the extremity of the antenna. The curvature of the cylinder increases as the 
frequency increases (this is the fundamental reason the patch antenna can be sensitive to 
curvature); hence the high frequency fields “see” a ellectrically larger cylinder than do 
the lower frequency fields. For the patch antenna, the input resistance depends on the 
normal fields beneath the patch and these are different for axial and circumferential 
polarization. Indeed, it can be shown that the normal fields beneath a circumferentially 
polarized patch are invariant with respect to curvature [4]. However, unlike the patch 
antenna whose input resistance depends on the field underneath the resonant antenna, the 
LPA’s resistance is invariant to curvature. The reason is likely due to the fact that the 
resonant structure (the quarter-wave stubs) are distant from the feed point. Note that the 
resonant stubs do appear as loads to the microstrip line and result in resonant lines in the 
spectral responses shown in Figures 4 and 5. Further investigation is warranted to tighten 
our understanding of this curvature invariant phenomena. 

In this paper, we investigated the impact of curvature on a popular wide bandwidth multi- 
mode antenna: the circular log-periodic antenna. We found, unlike the case of a patch 
antenna, that the input resistance is not dependent on the curvature of the antenna itself. 
This has important practical implications since it permits the design of such antennas 
using efficient planar design methods. It also relives the antenna integration team of the 
responsibility to carefully align these antennas on the supporting structure (e.g. aircraft, 
automobile, etc.). 

The conclusions reached in this study are for shallow cavity (substrate only) circular log- 
periodic antennas excited in mode 1 (total phase shift around the antenna of 360°) [7]. 
Further investigations will be undertaken to extend our conclusions to more general 
cases. 
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1.0 INTRODUCTION: 

This paper explores some of the problems encountered when attempting to accurately calibrate ship DF arrays, 
particularly at HF, and for skywave signals. 

Traditionally, a ship array response has been calculated using the geometiy of the array, suitable positions on 
the superstructure have been allocated, then the whole arrangement is calibrated by sailing the ship in a circle 
and taking bearings on signals of known azimuth (elevation was not considered). For several years, copper 
scale models characterised on a model range have been seen as an appropriate substitute for the full-scale 
platform, and used as a ‘first step’ generating the calibration data. More recently, work has been done to 
evaluate the validity manifolds generated from CAD wireframe models. 

The object of the research discussed in this paper is to evaluate the validity of copper and NEC (Numerical 
Electromagnetic Code) array manifolds. Ideally we would like to make wireframe models so accurate that the 
manifold for a ship DF array can be accurately predicted using software alone. This would enable quick re¬ 
calibration of an array where a modification has been made either to the array, or to the surrounding platform. 

2.0 METHODS FOR GENERATING DF MANIFOLDS: 

A manifold is a data set consisting of a collection of steering vectors that characterise the response of the DF 
array. Manifolds can be generated by geometric calculation, measurement (of the full-scale system or of a scale 
model) or by electromagnetic simulation. 

The more data points that are collected (increments in azimuth, elevation and frequency, plus orthogonal 
polarisations) the more accurate the potential DF response of the array, however, the time required to 
characterise the array increases accordingly. To represent an array over the entire HF band, it is recommended 
that the frequency step size be around 5% of the centre frequency, with closer frequency spacing in the region 
of significant platform structural resonance. The required steps in azimuth and elevation can be assessed 
according to antenna beamwidths and the algorithms to be used, but are typically of the order of 2° and 3° 
respectively. The step size of all parameters is determined with reference to the required DF accuracy of the 
system, the algorithm to be used, and the environment stability. 

2.1 Geometric Manifold: 

This is simply a description (in phase) of where the elements of the array are physically with respect to one 
another. All elements are assumed to have identical radiation patterns being identically aligned. 
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2.2 Full-scale calibration: 


In other words, sailing the ship in a circle within radio range of transmitters whose position is precisely known. 
Arguably, this is still the most accurate method of calibrating the array, although it is not practical to take data 
points above low elevations. 

23 Scale models: 

Scale models have an advantage in that the calibration signal can be accurately controlled in azimuth, elevation 
and polarisation. There are problems associated with obtaining high elevation test signals in the far field, and in 
the timescale required to collect sufficient measurements to characterise over a full half-hemisphere. There are 
also difficulties in assessing how accurately the model represents the full-sized array. 

2.4 Computer models: 

Computer simulated manifolds produced by analysis of wireframe CAD models are a recent method of 
generating array manifolds. At present, producing a complex wireframe model with sufficient detail to 
represent a cluttered platform is time intensive. Running the segmented model through a package such as NEC 
to produce a manifold at the appropriate frequency of interest is processor intensive. However, once a 
wireframe model for a particular platform has been produced, it can be modified and re-used with little extra 
effort. Similarly the run-time required to produce the manifold for the arrays should reduce rapidly as computer 
processing power and speeds increase. The main aim of the work described in this paper is to assess the validity 
of NEC manifolds in representing the full size array. 

3.0 WORK DONE: 

The procedure for collecting the data used to characterise the manifolds investigated is discussed in this section. 

3.1 Full-scale array measurement: 



Figure 1 - The full-scale array - Photograph of the Fraser building with DF array mounted. 
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For the full-size platform, an array of 10 loops were mounted on a building at DERA Fraser (8 of which were 
used). The building was modified with steel strapping so that it better emulated the conductivity of a ship hull. 
The site was chosen because it faced directly onto a beach, and signals used in the trial propagated over 
seawater. Signals were transmitted from a boat and from transmitters suspended by balloons flown from the 
boat, at a range of 1500m (far field) from the array. The positions of the transmitters were checked with GPS 
and by optical means (theodolite). Measurements were taken for the responses of the array elements over 7 
trials frequencies (between 2MHz and 26MHz). The manifold used for the full-sized structure was simply the 
geometry of the elements in the array, and was only defined in one polarisation. The signals measured were 
groundwave and expected to be vertically polarised. 

3.2 Copper Model: 

A 1:50 scale copper model of the Fraser site was constructed. This carried detail relevant to the highest 
frequency in use, and included site features likely to affect the responses of the elements in the array. The 
maximum (full-scale) distance between elements on the array was 35m, the overall site was some 150m in 
length. This allowed scope for investigation into how much the surrounding structures contributed to the 
aperture of the array. The copper model array manifold was populated for the 7 frequencies used in the Fraser 
boat trials, and in two orthogonal polarisations (H and V). The steps in azimuth and elevation were 2° and 3° 
respectively. 

In addition to the boat trials data, signal data was also collected on an 8-channel DF rack (synchronous 
measurement) connected to the scale model. The signals were transmitted from the probe antenna on the model 
range boom. This ensured that the signals generated were of controlled DOA and polarisation, and had the 
same degree of curvature to the wavefront as the calibration measurements. 



Figure 2: Photograph of the 1:50 scale model of Fraser on the model range turntable. 

3 3 NEC models: 

In the first instance, a very simple NEC model was generated for comparison purposes (using the 7 frequencies 
for the boat trial data). The model only represented the building carrying the array, with the loop positions 
defined as per the geometry. It was not anticipated that this would accurately emulate either the full-scale array 
or the copper model, since the major structures on the site (such as the radar towers) were omitted. 
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Figure 3: Simple wireframe model used for initial NEC manifold. 

A complex wireframe model was then constructed which contained all the detailed structure of the full Fraser 
site. The model was segmented to produce an output file to allow NEC to generate a manifold at the chosen 
frequency. The higher the frequency the manifold is being calibrated for, the more segments that are required 
(smaller wavelengths). Extra detail (more segments) were incorporated around the antenna elements. 



Figure 4: The 6500-element wireframe model used for detailed NEC simulation manifold. 

The number of segments required for a detailed structure can be prohibitive. The full wireframe model 
segmented for 20.99MHz amounted to 6500 segments (using A/10 segments). The wireframe model was 
initially reduced to around 5000 segments by stripping off the fence line, as it was felt that such a degree of fine 
grain structure (not significantly close to the array loops) was probably unnecessary for the manifold to be 
accurate. The 6500 segment version ran successfully, after 50 hours and with an upgrade to 1Gbit of RAM on 
the PC, (8 hours per frequency). Both manifolds were used to assess the impact of the fenceline on the 
modelled array response. 
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A manifold was produced in the far field at 20.99MHz, for use with the boat trials tracking data. A manifold 
was also produced at 16.019MHz, using a signal source specified at a distance equivalent to that of the probe 
from the copper model (around the near field boundary) for use with the data collected on the model range. 

Normally a NEC generated manifold assumes ‘true’ far field, using a planar wavefront excitation. However a 
facility exists which allows for signal source range to be specified. Using the ‘near field request’ a curved 
wavefront can be simulated. In theory, the range of the original boat trial could be modelled as a ‘near field’ 
request, as the distance of the signal source is not infinite, although it is in the far field. 


4.0 ISSUES TO CONSIDER: 

4.1 Problems with manifold comparison: 

Even if the copper model were to perfectly represent the full-scale site, the manifold would contain fine grain 
structure not present in the geometric manifold. This would give rise to differences in the DF results using the 
two manifolds, even without errors in the model calibration process. 

The copper model manifold is defined in two orthogonal polarisations, whereas the Fraser (geometric) manifold 
was not. Therefore a direct comparison of DF accuracy between this and the scale model manifold is of reduced 
value. The trials data was anticipated to be vertically polarised (groundwave over 1.5km) so both manifolds 
were assessed using MUSIC, with vertical polarisation selected in die case of the ’copper' manifold. 

Initial analysis of the tracking data collected at Fraser suggests mode conversion of the signals arriving at the 
loops. This cannot be accurately evaluated, as there may be phase errors in the definition of the 'copper' 
manifold masking or diluting the effect. 

4.2 Problems with the full-scale platform: 

The manifold for the array, being simple geometric calculation, assumes that the elements have identical 
radiation patterns, all aligned in the same direction. Analysis of the trials data demonstrates that the Fraser 
geometric manifold sufficiently described the array to give good results in azimuth, but gave less accurate 
results in elevation. Electromagnetic modelling of the array demonstrated that the loops mounted on the Fraser 
building had similar, but not identical radiation patterns. A geometric manifold would thus be inappropriate in 
the case of an array with elements having dissimilar radiation patterns. 

43 Problems with the model range: 

The calibration process used for the scale model manifold in our investigation was initially found to be 
inconsistent. Several factors contributed to this, the two main factors being environmental effects and 
inappropriate measurement procedure. 

4.3.1 Errors due to environmental effects: 

■ Instability in the probe position (due to gusts of wind flexing the boom arm). A movement of 15cm at 
1GHz is equivalent to X/2 , so introducing significant phase error into the measurements. 

■ Changes in temperature (particularly for long cable runs). For measurements taken over a protracted time 
period, some change in temperature is inevitable. It has been found that temperature variations can induce a 
corresponding phase variation in the measured signal. 
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■ Rainfall/humidity inducing pronounced phase swings in the measured signal. Over a slightly longer time 
scale, a change in the water-content of the surrounding ground can also give rise to errors (where the 
conductivity of the ground-plane is significantly altered). 

43.2 Inappropriate measurement procedure: 

These give rise to die most avoidable errors; problems encountered include the following: 

■ Facilities that allow for only one channel to be measured at a time. This implies that any changes in the 
environment from one channel to the next (or frequency, or polarisation to the next) causes fluctuations 
specific to that one element response. If channels are calibrated separately, the elements are inherently 
being characterised with different calibration sources (with subtly different DOA, no matter how carefully 
the measurements are arranged). 

■ Any change to the position of the model. This is a practical limitation for any calibration incorporating 
curvature of the wavefront. Any change in the centre of rotation of a scale model during a calibration run 
(such as incorrect repositioning after vital maintenance) will cause errors in the manifold. 

■ Alignment of the ‘origin’ of the array to true north, or 0° in Azimuth. Errors in aligning the model in this 
plane will cause an error in DF azimuth (which should be fairly uniform throughout the manifold). 

■ Not including a reference probe on the model. The use of such a probe would eliminate fluctuations in the 
signal level in the calibration path, such as those due to environmental change or drift in equipment 
parameters. 

433 Curvature of the wavefront: 

There is a fundamental problem where calibration for skywave is concerned. For groundwave or low elevation 
signals, adequate far-field measurements can be taken at all frequencies (providing the site has a large enough 
groundplane and is uncluttered). However, for elevations above 10°, using a boom and turntable arrangement, it 
is not feasible to produce sufficiently far field signals for arrays of an appropriate aperture (except at the lowest 
frequencies of interest). A manifold calibrated this way is potentially less accurate with increasing elevation 
(depending on the height of the model). If a signal source is at true far field, the array is seen as a single point, 
and the centre of rotation is irrelevant. The greater the proximity of the probe to the model, the greater the 
phase errors across the aperture of the array. 

Calibrating for skywave significantly increases the measurement time (by a factor of x30 if increments of 3° in 
elevation are used). 

Using a dedicated near field arrangement (using far field conversion techniques) was not an option for our 
study as there were no facilities available catering for the lower frequencies of interest at HF. 

4.4 Problems with the NEC models: 

NEC solves Maxwell’s equations by a method of moments for a collection of wires or surface patches excited 
by voltage sources or an incident plane electromagnetic wave. In our analysis we used wire grids to model 
conducting surfaces. 

The method is limited by the number of moments used and by the approximation on how currents flow. Wires 
are treated as one-dimensional structures, and NEC assumes that currents only flow axially. This does not take 
circumferential (or radial) currents into account, and must give rise to some modelling error. 
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The more detail that is used to describe an array (the array here being defined as the whole structure, with the 
elements being regarded as excitation points on the array), the more accurate the manifold. However, the more 
detail that is used, the more computationally intensive the process. The NEC manifold produced is equivalent 
to a matrix of all the element radiation patterns for the array (each taking the overall structure into account). 
Obtaining accuracy in the magnitude of the radiation response is relatively straightforward. The phase is less 
straightforward, and an accuracy of within 5° of the ‘truth 5 is suggested as a desirable goal. 


4.6 Problems with Skywave: 

In addition to the ‘near field’ problem of the model range, there was another factor causing the response in 
elevation to be poor. The DF response (not the accuracy) for all of the manifolds under investigation was worse 
for elevation than for azimuth. This is a function of the shape of the array being used, having aperture much 
shorter in elevation than in azimuth. Therefore the array had ‘deaf response to skywave, having poor resolution 
in elevation. This problem will affect all arrays of similar profile. 


5 EVALUATION OF THE MANIFOLDS OBTAINED: 

The evaluation of the manifolds produced is not straightforward. Essentially this was a 3-way comparison, 
initially with no common reference. The methodology employed was as follows: 

* Compare geometric results of the far field Fraser data with that of far field NEC models. This gives an 
indication of the difference between the simple wireframe model and the simple geometry, and then 
between the simple and the complex wireframe models (without introducing the factors encountered on the 
model range). 

■ Compare the manifold obtained from the copper model with the detailed NEC manifold stimulated with 
corresponding curved wavefront. This compares the NEC model to the copper model without introducing 
phase errors due to the proximity of the boom probe to the model. This is to gain an indication of how well 
the NEC model emulates the copper model, having a similar level of complexity and overall aperture. 

■ Compare the manifold from the same (detailed) NEC model excited with a planar wavefront and then with 
a signal defined at appropriate 'near’ source. This is to gain an appreciation of the degree of DF error 
induced by the curved wavefront. 

■ The validity of the copper model in representing the full-scale Fraser site may then be inferred, on the basis 
of eliminating the factors that make direct comparison impractical. 

A Matlab evaluation tool (written by Roke Manor Research Limited) was used for the manifold comparison. 
The software was modified to allow loading of externally generated manifolds. The manifolds were then tested 
using a DF algorithm and signal data obtained in the boat trial or on the model range. Manifold interpolation 
routines are required to permit DF processing of signals recorded at different frequencies to the manifold data. 

6 FINDINGS: 

6.1 The geometric manifold: 

As the Fraser array had elements with fairly similar radiation patterns, the geometric array was expected to 
accurately predict the response of the array. This would be less true for any signal ‘endfire’ to the array or 
originating from behind the building. All of the trials data was generated within a 150° arc mainly broadside to 
the array. 
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6.2 The simple NEC manifold: 


The very basic NEC model was derived from the geometry of the array this was also anticipated to give a 
reasonable DF response to the trials data. As the trials data was groundwave, and thus single (vertical) 
polarisation, this manifold gave good DF results using a single polarisation (MUSIC) algorithm. 

The ‘basic’ NEC manifold performed less well with skywave signals generated on the model range (using both 
polarisation sensitive and insensitive algorithms). This is probably because the signals collected with the copper 
model would be affected by the local scattering from the detailed metal structures surrounding the array 
elements. The more complex NEC manifolds produced later took the surrounding structural detail into account, 
and performed better with the range data. 

6.3 The copper model: 

There are problems in assessing whether the copper model accurately emulated the conductivity and reflectivity 
of the Fraser site. The reflective surfaces around the elements produce local scattering (effectively multipath), 
which widen the beamwidth of the array. Although measures were taken to make the Fraser building emulate a 
conducting structure at the frequencies of interest, there was no way of quantifying the accuracy of the 
representation. 

6.4 The copper model manifold: 

Using the trials data, the geometric and the ‘basic’ NEC manifolds compared quite well, whereas the copper 
model manifold gave a very different response. This indicated that there were problems either with the copper 
model or with the way it was characterised. On the main Fraser site (and on the models), there was a large 
tower within a few meters of the end of the ‘array’. At one of the trial frequencies the tower height represented 
a quarter wavelength, and this was expected to alter the phase centre of the array. Measurements were taken of 
the copper model with the tower present and with the tower down. While differences between the two 
manifolds produced were noted, the repeatability of the original manifold (‘tower up’) was found to be suspect. 
Thus reliable conclusions could not be drawn from the data. 

6.5 Model range phase stability with temperature. 

To investigate the apparent phase instability of the copper model measurements, an attempt was made to assess 
the impact of the environmental variations. A test was conducted whereby a 1/4 wave monopole was placed on 
the turntable, and a fixed 1GHz signal was monitored over a 24 hour measurement period. 

During the experiment the phase of the measured signal was seen to suffer an apparent diurnal fluctuation of 
around 40°. The phase variation appeared to roughly track the measured ambient temperature. The exposure of 
sections of cable in the calibration path to the elements (sunlight etc.) was found to be a significant factor. In 
addition, it was found that rain induced a more dramatic phase swing (possibly due to a combination of greater 
cooling effects and humidity between the channel connections), a variation of 120° being recorded at one 
instance (see figure 5). Note that the smaller fluctuations seen on the trace corresponded to clouds obscuring 
the sun. The substitution of a phase-stable coax on the boom significantly reduced this effect, as demonstrated 
in the second sweep plot (see figure 6). Note that the second run was conducted on a windy day, with no direct 
sunlight (but with no rainfall), so sharp fluctuations shown on this trace give an indication as to the phase 
excursions induced by wind gusts (2° worst case, +0.5° typical), with some underlying temperature effects still 
evident. Further experimentation revealed that errors due to boom movement in a gusting 15-knot wind over a 
short duration (i.e. no temperature variation) measured ±0.5° at 1.2GHz. 

Until recently, the model range facility was used to characterise individual elements and their coupling to the 
surrounding platform. This involved measuring field strengths and producing amplitude patterns, thus a 
deficiency in the phase stability of the system would not be readily apparent. 
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Figure 5: Phase drift of original calibration cable 
over a 24-hour sweep (with temperature). 
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Figure 6: Phase response with phase-stable 
coax in calibration path 1 


6.6 Detailed NEC manifolds: 


Using the signal data collected at Funtington range (controlled in polarisation and DOA), the analysis of the DF 
response of each of the 3 NEC models indicates that the more complex models give the greatest accuracy. This 
is expected since the ‘basic’ NEC model takes no account of the surrounding structures, and does not contain 
any fine grain structure (such as the metal staircase near element 5). 



Approximate postion 
of Flagpole 

X 

Figure 7: Polar element radiation patterns in ‘NEC* array at 7.556MHz (flagpole resonantl. 
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The inclusion of the fence line in the model gave better results than its omission, which implied that this 
structure had significant influence on the aperture of the array. It was also noted that at one of the test 
ffequencies_(7.566MHz), a fairly insignificant structure on the site had a pronounced influence on the response 
of the array. A small flagpole (9.5m in height) is visible to the south of the main array on the Fraser site, this 
appears as a slim vertical line on the wireframe model (see figure 4). A structure of 9.93m in height represents 
XJ 4 at 7.556MHz and would be resonant. Figure 7 shows a set of polar plots for the elements, as they appear on 
the array (viewed from above) and their response to a groundwave (vertically polarised) signal. The 
approximate position of the flagpole in relation to the array is marked ‘X’, and nulls are evident in the radiation 
patterns of the elements that appear to point towards the resonant structure. 


7 CONCLUSIONS: 

Measurement errors and process errors on the model range invalidated the copper model manifold in this 
instance. Most problems arose from the fact that the model range was principally designed to characterise 
single antennas and their coupling to the surrounding structure, or arrays of small physical dimensions. The 
phase-instability of the original cable to the boom probe was a major source of measurement errors. 
Synchronous measurement of all elements on an array is desirable at any one frequency, as it eliminates errors 
due to environmental changes, or fluctuations in the probe position. Where channels are characterised 
separately, they are not truly calibrated for the same incoming signal. 

Where a large copper model is used on a range with a boom and turntable arrangement, the centre of rotation of 
the model becomes significant due to curvature of the wavefront. Although the phase taper across the array may 
not cause gross DF errors, moving the centre of rotation between measurements will cause serious errors in the 
manifold. Copper models (characterised with appropriately controlled facilities) are still useful from the point 
of view that test signals can be generated for any azimuth, elevation or polarisation. 

At the wavelengths corresponding to the low end of the HF band (around 100m), a large aperture is required to 
achieve reasonable DF accuracy. In addition, larger arrays give greater resolution at a given frequency, 
allowing discrimination between close proximity signals. Where the aperture of an array becomes large enough 
to pose problems for scale modelling, computer-generated manifolds may prove the only practical solution. 

During the course of this work, a polarisation sensitive algorithm was assessed in parallel with the manifolds 
under investigation. Where an array has elements with diverse radiation patterns the response of the array will 
span the polarisation space. Characterising an array in two orthogonal polarisations enables signals of any 
polarisation to be captured and thus be available for DF. 

At present, the run-time for a highly detailed wireframe model at a single frequency is comparable with the 
measurement time required on a model range, however, this will reduce as computer processing speeds 
increase. Computer simulated arrays offer flexibility, and once mature, will provide a cost effective and fast 
calibration tool where modification of an array or platform has occurred. The natural progression to this is work 
is to use computer-generated manifolds to compare the performance of various DF algorithms under certain 
situations. For example, with correlated multipath (typical of HF), short duration signals (giving insufficient 
samples for HOS) or multiple target signals over a wide range of SNR. 

The traditional view of the array as a set of elements with a geometric relationship is challenged where non¬ 
identical elements and reflective structures are involved. On a model range for example, it is not sufficient to 
measure the individual element responses and post-process for the phase relationships. The phase centre of the 
array is suspect where a resonant structure is close to the array, and the overall aperture of the array is altered. 
The array should be viewed as a single antenna with various ‘excitation points’. 
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DESIGNING A VHF WRAP-AROUND DF ANTENNA ARRAY USING NEC 

Keith Lysiak, klysiak@swri.edu 
Southwest Research Institute, San Antonio, Texas 


ABSTRACT 

The Numerical 'Electromagnetic Code (NEC) was used to design a VHF wrap-around DF 
antenna array. A prototype array was constructed and extensive antenna response measurements were 
made at the SwRI antenna range. The measured responses matched the numerically modeled results 
very well with an overall mean correlation value of 0.986. NEC also produced beamformer 
characteristics very similar to the modeled characteristics. The numerically modeled results were used 
to produce array manifolds to estimate DF performance. An overall DF performance of less than 1 
degree RMS error was obtainable using the NEC generated array manifold. This is a first order 
indication that the array design is very robust. 

INTRODUCTION 

The design of direction-finding (DF) antenna arrays is a complex task. Numerous factors can 
affect DF performance. In complex scattering environments, accurate information about the 
interaction of the antenna elements with the surrounding environment is crucial to determining DF 
performance. Most DF systems are required to operate over a broad frequency range which means 
that structural resonances within the band of operation are important. These resonances can 
significantly distort the antenna response patterns and decrease the array performance. Fortunately, a 
few key characteristics of the array beam response can be used to estimate the DF performance. 

A DF antenna array design was required which could be mounted on a wide variety of tower or 
mast types ranging from telephone poles to water towers with minimal modifications. An adaptable 
fool-proof mounting design was desired that would simplify correct installation. Two performance 
characteristics of primary interest were the mainbeam width and maximum sidelobe level of the 
beamformed response. The array design was developed using the Numerical Electromagnetics Code 
Version 4 (NEC4) [1]. Numerical modeling can be a powerful tool to accurately calculate the 
individual antenna element responses and beamformer response characteristics. The array responses 
calculated by NEC were processed using a MATLAB DF algorithm to produce beamformer response 
patterns. Statistical analysis of these patterns provided the root-mean-squared beamwidth and 
maximum sidelobe level over all azimuths. By minimizing these two beamformer response 
characteristics, a optimal and robust array was designed. 

The best numerical design was used to build an experimental array for testing. The mounting 
structure was adjustable so that data for a number of geometrical configurations could be acquired. 
This provided an increased confidence in the NEC modeling by having numerous array geometries for 
data comparison. It was hoped that in the future, NEC could be used to design and predict the 
performance of unique installations eliminating the need for individual site measurements. 


790 




The numerically modeled data for the DF arrays was compared to experimental data in several 
ways. Individual element patterns were compared as magnitude and phase or real and imaginary 
components. The element responses were processed using a beamforming DF algorithm to predict 
beam response characteristics. And finally, the experimental data was cross-correlated with the 
numerically modeled data to provide a complete comparison between the experimental and 
numerically modeled array responses. 

ARRAY DESIGN 

The objective of this project was to design a wrap-around VHF array with robust DF 
performance that would permit installation around a wide range of tower types. This development 
permits the installation of a DF array in an environment in which clear access to the top of the tower is 
not available. The primary focus was on triangular structures such as standard Rohn towers, but the 
analysis technique is also applicable to a variety of mountings including ship masts. After an initial 
survey (using numerical modeling) of different numbers of antenna elements in circular and triangular 
arrangements, a triangular arrangement of nine elements was chosen. Numerical modeling of this 
array placed on a Rohn G-45 tower section indicated very good DF performance. Figure 1 shows the 
numerical model. 



Figure 1 Numerical Model of Wrap-Around Array 
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EXPERIMENTAL ARRAY CONSTRUCTION 


An adjustable nine-element array based on the numerical model was built for measurements at 
the SwRI range. The experimental array can be adjusted to match the nine configurations modeled 
with NEC. There were two primary objectives for the measurements. The first was to prove through 
experimental data that the array was capable of performing well over a range of radii and element 
separations. The second was to show a close correlation between the numerically modeled and the 
experimental data indicating that the numerical model could be used to predict the DF performance for 
other configurations. This will eliminate the need to perform measurements for each array installation. 
The antenna consisted of nine matched dipole elements arranged on three sides of a triangular array. 
Each side contained three elements placed in a straight line. The array was mounted to a Rohn G-45 
tower section. The entire assembly was then mounted on the antenna rotator as shown in Figure 2. 


Figure 2 Experimental DF Antenna Array Mounted on Rotating Tower Structure 

TOWER TEST FACILITY 

Measurements were conducted at the outdoor antenna test range located at SwRI. This antenna 
range consists of an all-wood multi-tiered tower approximately 70 feet tall with a transmitter located 
on the ground approximately 1100 feet form the test antenna. The antenna arrays being tested can be 
rotated through 360 degrees of azimuth under remote control to measure antenna patterns (magnitude 
and phase). The measurement system has the capability of simultaneously measuring 12 RF channels 
relative to a reference channel. Relative amplitude and phase data are typically recorded at one-degree 
azimuth increments. The phase and amplitude reference is an auxiliary dipole antenna placed 
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approximately 20 feet below the test array. The RF channels are routed through a sequencer, which 
selects one channel for output into a dual-channel receiver. The other input comes from the reference 
antenna. 

ARRAY MEASUREMENTS 

The antenna array was tested in nine different configurations over a range of frequencies. The 
configurations consist of three radii and three element spacings. Each of the nine configurations were 
tested at 121.5, 157, and 243 MHz. Additionally, three configurations were tested over the range of 
120 to 240 MHz at 10MHz intervals. The measurements were taken by transmitting at the desired 
frequency and rotating the antenna array through 360 degrees in azimuth. 

NUMERICAL MODELED AND EXPERIMENTAL DATA 

An initial comparison of the numerical and experimental data was done graphically. Figure 3 
shows a typical magnitude response of a single element. The elements used for this application are 
vertical monopoles with nearly omni-directional patterns. In the magnitude response we see the 
effects of mutual coupling between elements and the structure. Magnitude responses generally agree 
well within 1 dB. 


NEC vs Experimental 



Figure 3 Typical Magnitude Comparison of 
Numerical Modeled versus Experimental Antenna Response 
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Figure 4 shows a typical phase response of a single element. In the phase response we see 
primarily the rotation of the element about the center point of the array and distortion effects only on 
as a second order effect. The phase responses generally matched well within 10 degrees. NEC 
provided a very good estimation of the experimental antenna response patterns. 


NEC vs Experimental 



Azimuth (degrees) 


Figure 4 Typical Phase Comparison of Numerical Modeled versus Experimental Antenna Response 


The experimental array performed very well as a DF array as predicted by the initial numerical 
model. For the nine configurations tested, the beamwidth was generally below 20 degrees and the 
maximum sidelobe level never exceeded -2.0 dB with respect to the main beam maximum response. 
For the DF systems used at SwRI, sidelobes within 0.5 dB of the mainlobe are generally acceptable. 
Not only did the experimental version of this array perform very well, it performed almost exactly as 
predicted by the numerical model. Figure 5 shows a comparison for two beamform response 
characteristics as functions of frequency. The top plot shows RMS beamwidth, the lower plot shows 
the maximum sidelobe level relative to the mainbeam. 
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Beamform Response Characteristics 




Figure 5 Comparison of Numerical Modeled and Experimental Beamform Characteristics 


The degree of match of the individual antenna patterns and the beamformer characteristics led 
to the question of how well the predicted array manifold matched the details of the experimental array 
manifold. To determine the match, the numerically modeled array manifold was use a the calibration 
manifold and compared to the experimental array manifold. A standard deviation of DF bearing error 
was calculated as a function of frequency. A mean correlation value between the modeled and 
experimental data sets was also calculated at matched azimuths as a function of frequency. Figure 6 
shows a sample for one array configuration. The bearing error using the NEC calibration remains 
below 1 degree. This indicates the feasibility of using NEC to generate calibration array manifolds. 
The mean correlation is greater than 0.98 for all configurations and frequencies and the overall mean 
correlation between NEC and the experimental data was 0.986. As a reference point the mean 
correlation of the repeat data sets was 0.994. The numerical model can not be expected to do better 
than the measured repeatibility. 
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DF Performance - NEC vs Experimental 
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Figure 6 Standard Deviation of Error and Figure-of-Fit between Numerical Modeled and 
Experimental Array Manifolds 


The exceptional comparison of the NEC data to the experimental is extremely encouraging. 
This agreement means that numerous investigations can be conducted using only the NEC model 
without the need for additional measurements. Based on the initial success of this project, several 
other related areas of interest will be examined. Among them are investigations of effects caused by 
very large towers, guy wires, and the existence of nearby scatters such as other antennas. Additional 
information about these effects and a full summary of the overall numerical model performance will 
be presented in the conference. 
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CONCLUSIONS 


NEC worked exceptionally well for designing a VHF wrap-around DF antenna array. A 
general array configuration was designed for optimal DF performance and constructed for 
experimental measurements. A comparison of individual response patterns, beamformer response 
pattern characteristics, and array manifold correlations indicated that the numerically modeled data 
matched the experimentally measured data very well. 

All project objectives were met or exceeded. The wrap-around array design met all design 
criteria. The numerically modeled results were not only accurate enough to design the array but also 
accurate enough to predict DF performance and be used as a calibration array manifold. SwRI now 
has the tools to predict the performance and create array manifolds for the wrap-around array when 
mounted on a large variety of towers and masts. 
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Specifying A Direction Finding Antenna With Examples 


T. R. Holzheimer 
Raytheon Systems Company 
Aircraft Integration Systems 
Greenville, Texas 75403 

Abstract: 

Direction Finding (DF) antennas are typically implemented 
in linear interferometers, resulting in very high angle of 
arrival (AOA) accuracy. Several DF antennas have been built 
of the 360 degree, circular variety. Three antennas are 
discussed that cover the frequency range from 0.5 to 18 GHz 
and exhibiting AOA accuracies of 0.5 to 1.0 degrees RMS. In 
optimizing these antennas, a set of DF system 
specifications have been determined that apply to all types 
of DF antenna systems. Specifications addressing these 
generic DF antenna system requirements can be defined to 
provide desired performance. 

Introduction: 

Numerous high accuracy DF antennas have been described in 
numerous papers and textbooks over the years.[1] A 0.5 
degrees RMS, high accuracy, 2 to 18 GHz 360 degree 
azimuthal DF antenna has been described and presented at 
Antenna Applications Symposiums in 1995 and a similar 
version covering 0.5 to 2 GHz and 1 to 18 GHz has been 
reported at the 1999 Symposium.[2] Most DF antenna arrays 
use low gain elements such as spirals and dipoles. Some 
interferometers use medium gain horns if the depth is 
available. They are typically implemented in linear 
interferometer format and suffer field of view limitations 
on the order of +/- 60 degrees in Azimuth. 

A full 360 degree field of view is typically provided with 
circular arrays. The earliest types used low gain omni¬ 
directional elements. Subsequent investigators looked at 
using directional antennas such as flared notches. A 
multitude of different beamformers have been investigated 
as well as antenna elments with varying types of 
polarization capabilities. Derivations and explanations of 
the synthesis used to produce AOA from a circular array 
have been previously reported and will not be repeated 
here. 
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The circular DF antennas evaluated are of the simultaneous 
modal phase type. Figure 1 illustrates the different phase 
modes that follow periodic functions of 360 degrees, i.e., 
NX 360 degrees, where N is the modal phase order. These 
phase modes are typically setup with beamformers such as, 
Butler matrix or Rotman Lens. Figure 2 shows the basic four 
element block diagram for the phase and amplitude only type 
of beamformer. Amplitude information can be obtained with 
the addition of an additional network or beamformer. 

Specifying DF Antenna Requirements: 

The DF antenna is difficult to separate from the DF antenna 
processor that applies the AOA algorithm. The DF antenna 
and processor are tightly coupled and are best specified 
together. The perfect array, with no coupling, can provide 
a full unambiguous field of view. Realistic situations and 
installations yield ambiguity rates and RMS AOA accuracies 
that are driven by element spacing and signal to noise 
ratio (SNR) as in linear interferometers. The required 
specifications are summarized in Table X. 

Table I. Required DF System Specifications. 

1. RMS AOA versus Frequency. 

2. Unambiguous Data Rate versus Frequency. 

3. Untagged Ambiguity Rate versus Frequency. 

4. RMS AOA Accuracy versus Signal to Noise Ratio. 

5. Unambiguous Data Rate versus Signal to Noise Ratio. 

6. Untagged Ambiguous Data Rate versus Signal to Noise 

Ratio. 

Table I provides terms that can be evaluated and used in 
determining AOA accuracy for a complete DF system. Good 
data is labeled as unambiguous data rate, while bad data is 
defined as untagged ambiguity rate. This says that based on 
the use of, typically, proprietary algorithms and 
calibration, data can be made more accurate and false AOA's 
can be minimized and identified. Data will illustrate this 
later. The addition of SNR provides the difficulty in 
separating the DF antenna from the DF processor which can 
include active amplifiers and receivers. 
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A generic, 2 to 18 GHz, linear interferometer will yield 1 
degree RMS AOA accuracy with less than 4% ambiguity rate 
and a greater than 90% unambiguous data rate all down to a 
SNR of 8 dB. A generic, 0.5 to 2 GHz, linear interferometer 
will yield 1 degree RMS AOA accuracy with less than 5% 
ambiguity rate and a greater than 90% unambiguous data rate 
all down to a SNR of 8 dB. In both cases the linear 
interferometer will have a field of view of +/- 60 degrees 
in azimuth and + /- 30 degrees in elevation. 

360 Degree DF antenna Examples: 

Two 360 degree DF antennas are pictured in figures 3 and 
4.[3] Figure 3 shows the 2 (1) to 18 GHz version while 
figure 4 shows the 0.5 to 2(18) GHz version. The numbers in 
parentheses show the full frequency range that the DF 
antenna is capable of operating over. In all cases a 
residual 6.1 degree (one sigma) phase pair tracking error 
was used as a typical post calibrated, wide-bandwidth, 
phase discriminating receiver. The first two references 
also compare to a 11.1 degree residual phase pair tracking 
error. A Raytheon Systems Company proprietary algorithm was 
invoked using a metric to determine the AOA accuracy. 

The 2 to 18 GHz version circular DF antenna exhibits RMS 
AOA accuracy of 0.6 degrees, a 95% average unambiguous data 
rate and a 3% untagged ambiguity rate. The roll-offs with 
drop in SNR to 8 dB are as expected. This DF antenna is 
also useable down to 1 GHz. Figure 5 shows the RMS AOA 
accuracy versus Frequency. 

The 0.5 to 2 GHz version exhibits RMS AOA accuracy 
averaging 0.71 degrees and maximum of 1 degree. Amplitude 
information is included in the ambiguity rate determination 
in order to meet the 5% desired. Figure 6 shows the RMS AOA 
accuracy versus frequency over the 1 to 2 GHz portion of 
its operational band. 

Implementation Issues: 

In all the configuration examples for the circular DF 
antenna, the beamformer used was a Butler matrix, but it 
could have also been a Rotman Lens. Low noise amplifiers, 
filters and switch's may be required in order to provide 
the DF antenna and processor intermodulation product 
immunity. Depending on the individual antenna elements 
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used, the DF antenna aperture can be relatively light 
weight, but the processor can be substantial when including 
the receivers. The desired diameter of the circular antenna 
is the smallest possible which is also true for the desired 
length of the linear interferometer. However, there is a 
tradeoff between size, AOA accuracy, un-blocked field of 
view and DF antenna placement. 

Summary: 

It is extremely difficult to separate the DF antenna from 
the DF processor. Developing overall DF antenna system 
requirements is the best way to pursue the DF antenna, 
keeping in mind the desired DF system specifications listed 
in table I. Aperture size must be carefully investigated in 
order to keep installation limitations minimized. The 
installation itself must be carefully evaluated as this can 
determine to what extent and how much calibration must be 
implemented in the DF system. As shown, the DF system 
specifications can apply to any DF antenna such as the 
linear interferometer or circular DF antenna. 
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Figure 3. 2 to 18 GHz, Patent Pending, 360 Degree Circular 

DF antenna. 








Figure 4. 0.5 to 2 GHz, Patent Pending, 360 Degree DF 

Antenna 
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A High Efficiency Broad Band Wire Antenna System 
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1. Introduction 

The performance of any antenna system is defined as the product of the antenna’s efficiency and 
the antenna’s directivity. For wire antennas, the directivity of the antenna is strictly a function of 
the antenna’s electrical length and the efficiency of the antenna is strictly a function of how well 
the input impedance of the antenna is matched to the characteristic impedance of the transceiver 
(usually 50 ohms). The input impedance of a wire antenna is also a function of the electrical 
length of the antenna. While a vertical monopole antenna over a ground plane exhibits optimal 
gain for the frequency at which the antenna is approximately a quarter wavelength, maximizing 
the efficiency of a wire antenna system over a broad frequency band requires more advanced 
methods. 

For traditional broadband wire antennas, inductor-capacitor circuit traps (referred to simply as 
“traps”) have been inserted into the wire antennas in an effort to utilize a single wire antenna 
element for non-contiguous multiple frequency transmission and reception. This approach 
usually entails that the existing wire antenna be designed for the lowest frequency (longest 
wavelength) of interest, and that the other frequencies of interest are sufficiently different such 
that the resonant bandwidths of the inserted traps do not interfere with one another. Over the 
years, many design variations have been implemented since the concept was first patented in 
1941 [1-3]. While many claims have been made regarding the number of resonant frequencies 
and the bandwidths associated with each of the resonant frequencies, all of the previous designs 
share one common characteristic limitation: no previous trapped wire antenna design provides 
continuous coverage across the complete frequency band of interest . In order for the traditional 
trapped wire antenna design to function properly, there must exist gaps in the frequency band, 
which are at least as large as the resonant bandwidths of the inductor-capacitor circuit traps. 

While these prior assemblies are adequate for some applications, they cannot meet the broadband 
efficiency, directivity, and tuning response time needs of newer system requirements, such as 
with frequency agile radios. 
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2. Theory of Operation 


The general purpose of the new antenna design is to provide a high efficiency, broadband, wire 
antenna system. The system is intended to provide continuous RF coverage while maintaining 
maximum antenna gain and stable input impedance. A significant aspect and feature of the new 
antenna design is the use of a plurality of wire antenna elements of different characteristics such 
that regions of stable and acceptable impedance characteristics can be utilized on each antenna 
element for part of the total frequency band of operation. Total band coverage is achieved by 
combining the stable and acceptable impedance characteristics of each segment such that these 
segments overlap in frequency coverage enough to provide the desired frequency response across 
the entire band of operation. 

Another significant aspect and feature of the new antenna design is the placement of RF traps on 
the antenna elements such that the antenna pattern is controlled for that segment of the antenna 
element’s frequency region of operation. Significant lobing of the pattern is prevented by the RF 
traps acting as band-stop filters limiting the current distributions to their fundamental operating 
regions. Total band coverage is achieved with a plurality of antenna elements and a 
corresponding plurality of RF traps that control the operation of different segments to provide 
total frequency coverage across the band of operation. 

An example of one such a high efficiency, broadband, trapped wire antenna system, consisting of 
two wire antennas and four inductive-capacitive traps, is shown in Figure 1. The inductive- 
capacitive traps serve to limit the RF current flowing along the wires at various frequencies 
allowing the antenna system to appear to have physical lengths that correspond directly to the 
wavelength of interest. The bold arrows in Figure 1 represent these currents. In essence, the 
two-wire antenna system is broken up into six sub-antenna systems, each of which is capable of 
simultaneous transmission or reception. The two-wire trapped antenna network is then 
connected to the transmitter(s) via a series of RF filters and matching networks. 

The connection network is used both to direct the radiated signal to the correct sub-band antenna 
and to impedance match, if required, the resultant sub-band antennas to the transceiver. Hence, 
each resulting antenna sub-band has a RF filter and impedance matching network associated with 
it to passively select the proper antenna and ensure maximum efficiency. Maximum antenna 
efficiency, at any given frequency, is obtained by matching the input impedance of the antenna 
to the characteristic impedance of the transmitter. The technique of adjusting the electrical 
length of the wire antennas with the inductive-capacitive traps also serves as a passive first order 
method of stabilizing the antenna’s input impedance. 

The trapped antenna network is designed such that the longest wire element corresponds to the 
first resonant length of the lowest design frequency (largest wavelength). For the monopole 
case, this length is approximately a quarter wavelength and for the dipole class, this length 
approximately a half wavelength. At this frequency (the lowest design frequency), none of the 
inductive-capacitive traps are active and the set of band-pass filters directs the signal to the 
proper antenna. Because the antenna is resonant at this particular frequency, the matching 
network does little to no work in ensuring that the antenna is properly impedance matched to the 
transceiver. 
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Figure 1. Banded Current Distributions 

As the frequency is increased, the longest wire element is no longer resonant and the matching 
network begins to play a more active role in ensuring that the input impedance of the antenna is 
properly matched to the transceiver. This process continues until the longest wire element is too 
far from resonance to function at the pre-set efficiency level. In this region, the filter set 
transitions the signal from the existing wire element to the adjacent (second longest) wire 
element whose length has been designed to the resonant length of the switch over frequency. All 
of the inductive-capacitive traps are still not active and all that has occurred is that the signal is 
transitioned to a different radiating structure that is more efficient. 

As the frequency is increased further, the matching network associated with the second longest 
antenna sub-element begins to match the impedance of the element to that of the transceiver. 
Again, this process continues until the radiation efficiency of the second antenna element 
transitions below the efficiency level of the next element. At this point, the band-pass filter 
transitions the signal back over to the first antenna wire. 
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This time, the upper most inductive-capacitive trap is active, stopping any RF current flow to the 
upper portion of the wire. The location of the upper most inductive-capacitive trap is set such 
that the clipped wire element is now once again resonant at the new higher frequency. 

As the frequency is increased further, the matching network associated with the third sub¬ 
element adjusts to ensure that the impedances of the antenna and transceiver are properly 
matched. Once again, this process continues until the third longest wire element is too far from 
resonance to efficiently function. The signal is then once again transitioned over by the band¬ 
pass filter and antenna element impedance to the other antenna wire. 

At this new higher frequency, the upper most inductive-capacitive trap on the shorter antenna 
wire is now active, stopping any RF current flow to the upper portion of the shorter antenna wire. 
Once again, the clipped wire element is resonant at this higher frequency. This entire process is 
continued until all of the inductive-capacitive traps existing on both wire elements have been 
utilized. 

When the frequency is raised to a point that the shortest antenna element no longer radiates 
above the pre-set efficiency level, the antenna system has reached its upper frequency limit. 
While this process has been described for a two-wire antenna network, the design may be 
extended to include any number of wire elements containing any number of inductive-capacitive 
traps. 


3. Prototype HF Antenna System 

A prototype antenna was design and built which utilized two long wire antennas and four traps to 
achieve coverage over the entire HF band (2 to 30 MHz). The new dual-wire trapped-antenna is 
designed to replace an existing antenna system consisting of two equal length wires. The dual¬ 
wire trapped-antenna consists of two wire antennas of different lengths. The first wire had the 
same length as the existing antenna and the other wire length was approximately half the length 
of the existing antenna. Each wire antenna had two traps installed along it. A 11.5 MHz trap 
and a 20 MHz trap were installed on the longer wire with the 20 MHz located nearer to the 
antenna feed. A 14.7 MHz trap and a 28.7 MHz trap were installed on the shorter wire with the 
28.7 MHz trap nearer to the antenna feed. 

The dual wire trapped antenna system was designed and analyzed using computer simulation 
techniques. The trapped wire antennas were modeled using NEC-4 [4] running on Silicon 
Graphics and Sun Workstations. The analysis was performed on both the long and short wire 
antennas for frequencies from 2 to 30 MHz in 1 MHz increments. The computed real input 
impedance for both the trapped wires is shown in Figure 2. 

Prototype antennas where then constructed to demonstrate the modeled antenna results. The 
antenna test platform utilized was of the same size and shape as that of the upper half of the 
aircraft that the existing antenna system was originally designed to operate with. The aircraft test 
platform was chosen in order to demonstrate the performance improvement of the dual-wire 
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trapped-antenna over the existing antenna configuration. The test platform was built out of wood 
and covered with screen. The actual feeds that are used on the aircraft were incorporated into the 
test platform. The antenna impedance was measured for each test using an HP Vector 
Impedance Meter from 2 to 30 MHz in 0.1 MHz steps. 



0 5 10 15 20 25 30 

Frequency (MHz) 

Figure 2. Modeled Dual Wire Trapped Antenna - Real Impedance 


The measured data shows that the prototype dual wire trapped antenna performed as predicted. 
The measured results correlate fairly closely with the modeled results. The comparisons between 
the measured and modeled results are shown in Figure 3 for the trapped long wire antenna and 
Figure 4 for the trapped short wire antenna. The minor differences between the modeled and 
measured results are mainly due to the fact the traps were not resonant at their actual design 
frequency. This was primarily due to the capacitors that were used in the traps. The capacitors 
were commercial capacitors that were accurate to within ±20%. In addition, the tuning of the 
inductive coils was performed by manually shorting out the turns of the coil. This is a coarse 
method of tuning and can lead to small variations in the inductor values. These differences can 
be eliminated in the future by the use of variable capacitors in the traps. 
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Real Impedance 



Figure 3 

Measured vs. Modeled Real Impedance Data for the Trapped Long Wire Antenna 



Frequency (MHz) 


Figure 4 

Measured vs. Modeled Real Impedance Data for the Trapped Short Wire Antenna 











4. Impedance Matching Advantages 


As can be seen from Figure 2, the real input impedance variations of the long and short wires are 
completely out of phase with one another; hence, when one is unstable, the other is stable. This 
offset in impedance is a direct result of the four traps (two on each wire) which were installed 
into the system. Each wire has two resonance points. These resonance points are offset so that 
when one of the wires has a high resonance point, the other antenna wire has an acceptable 
impedance value. Therefore, the system can switch back and forth between the two wires and 
maintain reasonable input impedance across the entire band of operation. The frequency bands 
can be divided as follows: 

Frequency (MHz) Antenna Wire 

2 to 3.5 Long 

3.5 to 6.5 Short 

6.5 to 10.5 Long 

10.5 to 19.5 Short 

19.5 to 26.5 Long 

26.5 to 30 Short 

By utilizing these frequency bands, the new trapped long wire antenna will be significantly 
easier and quicker to impedance match. Also, the imaginary impedance is monotonic across 
each of the frequency bands. This will allow the matching circuit to be very simple, requiring 
only one variable reactive component. Figure 5 shows the real input impedance of the dual wire 
trapped antenna as each antenna wire is used according to the table above. 



Figure 5. Dual Wire Trapped Antenna Real Input Impedance 
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5. Conclusion 


Although many trapped antenna designs exist in both the amateur radio and military 
communication realms [1-3], all of these antenna designs are multi-band systems, which are 
designed to work at a finite set of predetermined frequencies. By utilizing multiple wire 
elements and controlling their current distributions utilizing RF traps, a truly broadband antenna 
has been developed rather than a multi-band antenna which is commonly associated with RF trap 
technology. 

The dual-wire trapped-antenna design offsets the resonance points of the two wires so that they 
do not coincident in frequency. By shifting the resonances, the antennas can now be broken into 
frequency bands that have stable real impedance values and have monotonic imaginary 
impedance values. This improved input impedance performance allows for much easier 
impedance matching. The required matching network will be much simpler than the existing 
networks needed and will be able match any frequency very quickly. 

Finally, the dual-wire trapped-antenna system concept was proven by the use of computer 
modeling. The prototype antenna design was based upon numerical optimization of the 
computer models. The measured results of the prototype antenna closely matched the predicted 
results obtained from the computer modeling. The dual wire trapped antenna system is 
applicable to any wire type antenna system having elements configured as monopoles, dipoles, 
or slots. ARINC currently holds the patent for the new high efficiency broad band wire antenna 
system (Patent Number: 5767812). 
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ABSTRACT 

A circular array is modeled using two independent moment method codes. The array utilizes closely 
spaced high-gain end-fire elements to maximize the volumetric efficiency of an ellipsoidal enclosure, 
while minimizing the number of elements requiring control. An initial element was designed with an 
optimizer based upon the NEC4 engine; however, the use of moderately thick wires, wire-wire 
spacings of approximately one diameter, and non-trivial feeds required a wire/plate code for accurate 
element performance prediction. At the same time, efficient modeling of the entire array required a 
second code with a discrete body-of-revolution (DBOR) capability. Both electromagnetic codes were 
used in conjunction to arrive at a complete array design. With the exception of the highest frequency, 
where balun difficulties were plainly evident, correlation between predicted and measured results are 
very good. 

1. INTRODUCTION 

Utilization of multiple electromagnetic codes is sometimes necessary for modeling antennas. 
Frequently, the capabilities and strengths of a particular code are well suited to the geometry of a given 
antenna, whereas another code is appropriate for modeling other additional features. At times, one must 
incorporate certain geometrical features into the antenna design to meet desired performance criteria, 
criteria that may not be well represented by any one piece of modeling software. This paper describes 
such an array, where the element design was completed with a wire/plate code called WIPL-D [1], and 
the array modeling was performed with NEC4. The array we are referring to is an experimental 
prototype for a UHF Electronically Scanned Array (UESA), which is proposed as a replacement for the 
current mechanically rotated radar array which is mounted atop the Navy's E-2C. A surveillance radar 
realizes numerous benefits when instantaneous 360° beam agility is available; however, support for 
such a retrofit is difficult to come by unless the replacement antenna provides equal or superior 
performance. This is a difficult task not because of the inherent geometry of circular arrays, but rather 
because the end-fire architecture results in considerable unused space in the center of the ellipsoidal 
radome. The radome center is also where the greatest vertical separation exists. Monopole arrays 
provide a simple way of utilizing the space in the center of the radome, but result in vertical 
polarization, which is undesirable for low frequency over sea radars. The array described herein is the 
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result of an aggressive effort tty to obtain the maximum gain possible, while providing sidelobes in the 
30-40 dB range. 

2. ANTENNA ELEMENT DESIGN & MODELING RESULTS 


The Yagi-Uda was selected as the array element because of its high gain, simple feed arrangement, and 
transparency to orthogonally polarized antennas that may be located behind it. In order to handle very 
high power and prevent corona discharge from wire ends, % inch diameter tubing was selected. From 
previous circular array studies [2], it was determined that 50-60 elements were required to obtain the 
desired sidelobes while maintaining the desired azimuth plane beamwidth. 


A challenge is imposed by the need for this many elements. Assuming a Yagi reflector length of 
approximately X! 2, elements operating at the UHF radar band are limited to about two feet in length 
(measured along the boom) for a 24-foot radome. Such an arrangement would lead to poor volumetric 
utilization. The Yagis can be lengthened by applying some type of shortening scheme to the rearmost 
elements, such as the disk on a tophat monopole. We elected to bend the wires on the reflector, driven 
wire (we avoid the term driven element to avoid confusing this with an array element ), and the first 
two directors. This method enabled the Yagi to be lengthened by a factor of nearly two. Various other 
types of elements, such as the cubical quad, were tried with little success. The quad appears to be an 
attractive choice for the circular array configuration because the reflector (from a top-down 
perspective) is only about X! 4 in length, allowing a much longer element. One finds quickly, though, 
that the strong mutual coupling between the vertical wires of adjacent elements causes the active 
element VSWR to deteriorate rapidly as the quad is lengthened. 


The first element was optimized with the NECOPT optimizer [3]. Initially, the shortening stubs were 
arranged in a symmetric manner, as shown in Fig. 1(a). It was then discovered that with 60 elements, 
coupling of the adjacent elements was severe enough to result in array (reflection) efficiencies of 
around 60%. Considerable experimentation resulted in the element shown in Fig. 1(b). In this 
configuration, coupling between adjacent pairs of shortening stubs is considerably reduced. By 
introducing the odd-angled shortening stubs and reducing the number of elements to 54, array 
reflection efficiencies of approximately 90-95% were observed in the NEC4 model. The array can be 
modeled efficiently with NEC using the GR card (circular symmetry)—a 54 element array with 13,000 
segments can be run in about 9 minutes on a 300 MHz PC. 
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A single full-size element was built and VSWR tested on an HP 85IOC network analyzer. Results were 
disappointing, as the VSWR at the upper portion of the band was poor. Several characteristics of the 
geometry lead to non-uniform currents (circumferentially) on the surface of the wires, a violation of the 
fundamental assumption used in the NEC formulation. The use of thick wires in conjunction with 
closely spaced wires (in particular, the spacing between the driven wire and first director is only about 
one wire diameter) leads to non-uniform currents. The close spacing is required to obtain the desired 
bandwidth. The comers also induce non-uniform currents, but we showed in a previous paper [4] that 
NEC appears to handle thick wires with 90° comers nearly perfectly. The last characteristic responsible 
for non-uniform currents is the feed. The final arrangement included a parallel plate feed that attaches 
to one side of the driven wire. The NEC E-field voltage source model assumes that the electrical field 
is completely uniform in the gap region. The parallel plate feed results in a decreased resonant 
frequency. Simply shortening the driven wire is not an effective means of compensation, since the 
shortening scheme removes the actual alignment of the stubs at the end and their associated mutual 
coupling. 

In order to accurately predict the element performance, a new model was created using the WIPL-D 
package. WIPL-D uses a combination of plates and wires in an EFEE approach. Instead of using wires 
like NEC, the rearmost wires were "built" from plates as eight-sided hollow tubes. To reduce the total 
number of unknowns, the five wires in the front (directors 3-7) wires were modeled as uniform tubes of 
current as would be done in NEC. These wires, in addition to having no feedpoints, have no 90° bends 
and are displaced far enough from each other that the current can be assumed to be uniformly 
distributed along the wire circumference. 

The WIPL-D model revealed instantly why the VSWR at the upper band edge was poor. From Fig. 
2(a), we see that the radiation resistance drops rapidly above 440 MHz. The WIPL-D model was used 
to make adjustments such that the radiation resistance was flattened. In addition, it increased enough to 
work directly with a 50 Q characteristic impedance. The resulting element is shown in Fig. 3. 



Figure 2—(a) Impedance of element of Figs. 1(b) and 3. (b) VSWR (50 Q ) of isolated and embedded element (of Fig. 3) 
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Figure 3—Top-down view of final wire/plate hybrid WIPL-D model. 


The gain of both element models was corrected by integrating the radiation pattern over all space. 
Interestingly, the NEC model required significant corrections at all frequencies. This is attributed at 
least in part to the optimization process. Since the optimizer seeks to minimize an error function that is 
based upon the code outputs, the optimizer will attempt to find the "Achilles heel" of the code. 
Optimistic results are expected from such an automated process, subject to the accuracy of the code 
and the particular model in question. 


Table I shows the computed gain of the element on its boresight, correction factors, and corrected 
boresight gain for both the NEC and WIPL models. Once corrected, both models agree at all points 
within the band to an error of less than 0.1 dB. 


TABLE I 

UNCORRECTED AND CORRECTED BORESIGHT DIRECTIVITIES FOR NEC AND WIPL MODELS 


ESIffiEBB 

Computed Dir. (dBi) 

Correction (dB) 

Corrected Dir. (dBi) 


NEC 

WIPL 

NEC 

WIPL 

NEC 

WIPL 

400 

13.06 

12.49 

-0.48 

0.08 

12.58 

12.57 

425 

13.70 

13.02 

mzm 

0.08 

13.17 

13.10 

450 

14.30 

13.48 


0.09 

13.63 

13.57 


Since WIPL does not yet have the rotational symmetry capability, all array modeling was performed 
with NEC. While differences in the isolated element VSWR are observable in Fig. 2(b), the 
differences are not overwhelming. Radiation patterns from the two codes are in good agreement. In 
Figs. 4 and 5 are the isolated and embedded element patterns. 
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Figure A —Principal plane cuts of (a) azimuth, and (b) elevation patterns of isolated element at 400 (...)> 425 (- -), and 450 
(-) MHz. Peak value of the graph is 13.6 dBi. 



(a) (b) 

Figure 5—Principal plane cuts of (a) azimuth, and (b) elevation patterns of embedded element at 400 (...), 425 (- -), and 
450 (-) MHz. Peak value of the graph is 9.7 dBi. 


3. MEASUREMENT RESULTS 






A complete l A scale model of the array was fabricated at Penn State University and shipped to 
NAWCAD Patuxent River for testing. Measurements included azimuth and elevation cuts for isolated 
elements, as well as azimuth cuts for embedded element patterns. Table II provides a comparison 
between the modeled and measured beamwidths and gains of the isolated element. Table HI 
summarizes the available gains and beamwidths, synthesized from measured embedded element 
patterns. Results are include for a 27-element uniform excitation, and 27/54 element low sidelobe 
excitation resulting from an iteratively weighted least squares (LS) method, as defined in [5]. 

It should be noted that the balun was not included in the electromagnetic models. The best performance 
of a prototype element was observed with a split-wire balun, attached directly at the feedpoint. 
However, because of the difficulty in fabrication, its fragile nature (at the scale model size), and time 
constraints, the split-wire version was replaced with more easily fabricated remote bazooka balun. 


TABLED 

COMPARISON OF NEC MODELED AND MEASURED RESULTS (ISOLATED ELEMENT) 



NEC 

Measured 

Freq 

Azimuth 

Elevation 

Gain 

Azimuth 

Elevation 

Gain 

(MHz) 

BW(°) 

BW (°) 

(dBi) 

BW ( 6 ) 

BW ( 6 ) 

(dBi) 

400 

48.9 

41.0 

12.6 

48.0 

41.3 

12.5 

425 

45.6 

38.6 

13.1 

44.2 

39.4 

13.1 


41.4 

36.4 

13.6 

43.4 

38.8 

12.6 


TABLE DI 

SUMMARY OF MEASURED (SYNTHESIZED) ARRAY PATTERNS 


Freq 

(MHz) 

Excitation 

Gain 

(dBi) 

Az BW 
(°) 

Peak Sidelobe 
(dB rel Max) 


Uniform 

EB2 

MEM 

11 


27 LS 


8.2 

26 


54 LS 


7.8 

35 


54 LS 


8.2 

40 


Uniform 

EE! 

6.7 

10 


27 LS 

Egg 

8.1 

30 


54 LS 

EEI 

7.7 

35 


54 LS 

EE 

8.2 

40 

ES9 

Uniform 

EMEU 

7.0 

9 


27 LS 

Egg 

8.1 

24 


54 LS 1 


7.9 

35 


54 LS i 


NA 

40 
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Figure 6—1/4 scale model array. 

Azimuth cuts of the embedded elements are shown below in Fig. 7. Isolated element measurements are 
not shown, for lack of space. 



(a) (b) 



(c) (d) 


Figure 7—Computed vs, measured azimuth patterns of embedded element at (a) 400 MHz, (b) 425 MHz, (c) 450 MHz. (d) 
Low sidelobe pattern synthesized from 425 MHz (1.7 GHz) measured data with all elements active. 
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Figure 8—Samples of (a) isolated and (b) embedded element VSWR at the scale frequency. 

4. CONCLUSIONS 

A complete 54-element circular array was designed using both the WIPL-D and NEC4 codes. WIPL-D 
proved to be critical for obtaining an acceptable element design, while NEC's ability to exploit the 
inherent rotational symmetry of the problem was essential for modeling the entire array. Feedline 
radiation at 450 MHz, resulting from the partially ineffective bazooka balun, caused the single 
exception to a superbly correlated set of modeling and measurement results. Measured azimuth and 
elevation plane beamwidths of the isolated element were within about 1° of that modeled at 400/425 
MHz. The measured boresight gains at these frequencies were within 0.1 dB of that observed from the 
models. Excellent correlation of measured and modeled embedded element patterns is also evident in 
Figs. 7(a) and 7(b). 
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ABSTRACT 

Small (sub-wavelength) position offsets of antenna array elements is a known technique for reducing array 
sidelobes, particularly when grating sidelobes are present in the array beam pattern. However this method is 
of limited benefit as the individual antenna elements are a significant fraction of a wavelength in size and so 
the available space to adjust their positions is too small to be effective. Mutual coupling also proves a 
constraint. Fractal antenna elements, which are resonant, low loss, of moderate bandwidth, and a smaller 
fraction of a wavelength in size can be used successfully to exploit this method of array sidelobe reduction. 
With fractal elements, more “real estate” of the phased array is available to accommodate position 
adjustments that reduce array sidelobes. We show an example of such a system as a 4 x 4 array of 
Minkowski-2 antenna elements, spaced at nominal separations of X and have sizes of X/S, with offsets of 
order X/S from a uniform distribution. Sidelobe reductions of 5-10 dB relative to an array of quarter- 
wavelength dipoles are shown in NEC4 simulations. The method proves amenable to use in conjunction with 
tapering. 


1. Introduction 

Sidelobe magnitude is an important design consideration for many phased array systems. Reducing the size 
of sidelobes reduces the power from interfering sources in phased arrays used for receivers with 
corresponding gains in system performance. Complementary advantages are obtained for phased arrays used 
for transmission. Development of a new or modified technique for sidelobe reduction in phased array design 
has implications for a wide range of practical problems. 
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A phased array (used as a receiver) may be viewed as measuring a sampling of the electromagnetic field 
impinging on it. In the Fourier domain, the phased array has response over spatial frequencies corresponding 
to the inverses of all the relative separations of the array elements. Since there are only a limited number of 
elements and a finite set of relative spacings, sampling in the Fourier domain is necessarily incomplete, 
which results in sidelobes in the array beam pattern. When the sampling in the Fourier domain is so 
incomplete as to violate Nyquist’s theorem, i.e. with element spacings greater than A/2, grating sidelobes 
appear which are the result of aliasing in the spatial Fourier domain. Additionally, the use of uniform arrays 
produces periodic sampling of selected spatial frequencies, enhancing periodic sidelobes. 

These general theoretical considerations motivate strategies for improving phased array performance by 
arranging the antenna elements so as to sample more efficiently the spatial Fourier domain, while keeping the 
number of elements small. Some examples of such approaches include randomly distributed elements and/or 
fractally distributed elements in phased arrays (Kim and Jaggard, 1986). Generally, these methods seek to 
subtract elements from very large arrays to achieve reasonable performance. 

In this paper we consider designs close to the canonical phased array designs, and then modify these designs 
by offsetting array elements so as to improve array performance by improving sampling in the spatial Fourier 
domain. Usually, this would not be considered as an attractive approach because the antenna elements in a 
typical phased array, e.g. a half-wavelength dipole, cannot be offset by any significant amount without 
running into other array elements, or yielding problems from mutual coupling. However, if we construct a 
phased array with fractal antenna elements, the antenna elements can be smaller by a factor of two or three, 
and we then have sufficient space to explore the effects of antenna offsets on the array beam pattern. We 
will consider examples in one and two dimensions with the array fractal elements. Simulation results from 
NEC4 will be presented that illustrate the performance of such an array. 


2. Mathematical Discussion 

We develop here the mathematical formalism describing sidelobe reduction by sub-wavelength offsets of the 
positions of antennas in the phased array. The formalism is based on the description of a uniformly spaced 
one-dimensional array as described in Balanis (1997) and in many other references. The array factor of an N 
element array of elements uniformly spaced with separation d is: 

AF - ^ exp \j(n -1 )(kd cos 9 + p )\, (1) 

n=I 


where k = Inf A is the wavenumber of the electromagnetic wave, 6 is the azimuth angle, and p is the beam 
steering angle. We want to extend this formalism by putting in d ai the position of the nth antenna in 
explicitly. For the case of linear spacing, d n - d{n - 1), and if an offset or “dither” of 4 in the position of the 
nth element is allowed, 

d ri =d(n-\) + S n . (2) 

and so if we incorporate the dithered positions in the expression for the array factor, we obtain 
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AF = f>p[/*((«-l)</ + <?„)+/(» -1)/?] 

«=1 

N 

~ ^ exp[/(« - 1)(M cos 6 + /?)] expt/M,, cos 0 \. 


This expression allows us to understand the mechanism by which introducing spatial offsets in antenna 
position can lead to a reduction in the size of sidelobes. As can be seen in equation (1), the array factor is a 
sum of exponential factors, i.e. phasors, and so at any given angle and beam steer, the beam power depends 
on whether this phasor sum adds constructively or destructively. Introducing a spatial offset of the antenna 
element positions, multiplies these phasor terms by another complex exponential which changes their phase, 
and hence affects whether the phasor sum is adding constructively or destructively at any particular azimuth 
angle and beam steer angle. Different choices for the <%s will enhance or diminish different features in the 
beam pattern. Appropriate optimization schemes can be constructed for choosing the in different phased 
array design problems, but development of that subject is beyond the scope of the present paper. 

Equation (3) lets us illustrate this point with a simple 1-D example. Consider a baseline array with N= 8 and 
d = M2, with uniform excitation at each element. We calculate the beam pattern of that array at a zero beam 
steer angle and show the polar diagram of the beam pattern for reference in Figure 1A. To illustrate the 
effects of introducing the offsets in the element positions we arbitrarily chose, with no effort to select an 
optimal solution, a vector £of offsets (in 1-D) for the element positions given by 

kS = (0.0,0.25,0.5,0.75, - 0.25, - 0.5, - 0.75,0.0). (4) 

The resulting beam pattern is given in Figure IB. In both beam patterns, 40 dB of dynamic range of the 
beam is plotted, normalized relative to the amplitude of the main beam at 90°. The main beam in both 
cases has an amplitude of 18.03 dB absolute. In the baseline case the first sidelobe has an amplitude of 


Amy 8«»m P*tt*m 



DiCiered Array B*am Pattern 



Figure 1: Beam pattern at left is for an 8 element linear array of uniformly excited elements spaced at 
intervals of M2. At right is the beam pattern resulting when the positions of this array are adjusted by the 
offset vector given in equation (4). The amplitude of the first sidelobe has been reduced by 3.6 dB. A 
substantial gain in performance has been obtained, although the choice of offsets has not been optimized. 
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5.26 dB, te. 12.76 dB down from the main beam, while in the case with the positions adjusted, the first 
sidelobe is at 1.67 dB, te. 16.36 dB down from the main beam. The offsets have improved the performance 
of the array in terms of the first sidelobe amplitude by 3.6 dB. In the case with offsets, the second sidelobe 
appears at a higher amplitude than the first sidelobe. 

It is clear that adjusting the antenna positions has significantly changed the sidelobe structure in the array 
beam pattern, including the presence of a new sidelobe at end-fire. The offsets chosen here are merely 
illustrative, rather than intended to optimize array performance overall. Other choices of offsets can be made 
to optimize particular properties of the array beam pattern as desired. 


3. Description of the Fractal Antenna Element and Array Motivation 

Fractal elements incorporate self-similarity in order to change the operational characteristics of the antenna 
(Cohen, 1995). The elements under consideration here consist of fractal loops with a rectangular motif, 
dubbed Minkowski (Koch) fractal island loops (Cohen, 1995). These, among others, have been identified to 
have shrunken sizes (about A/8 wave on a side) while maintaining very high efficiency. These loops, in 
iteration 2, have impedances of order 30 ohms real, and two parallel current maxima. This effectively gives 
this loop higher field strength than expected from the radiation resistance given by the small loop 
approximation (Cohen and Hohlfeld, 1996) 

These ‘MI2’ loops were chosen because their small size when resonant allows greater versatility in the 
spatial offsets to be used. Furthermore, unlike other small elements, such as A/8 dipoles, MI2 is self-resonant 
and requires no additional loading components, which produce higher losses. Finally, the useable bandwidth 
of the MI2 far exceeds that of other small loaded antennas of the same size, due to the higher radiation 
resistance caused by the two current maxima. 

It has been known for some time (Kim and Jaggard, 1986) that random, fractal distributions of elements on 
an array lead to sidelobe reduction. However, such reduction requires rather large offsets from a uniform 
distribution. Thus we were motivated here to show that fractal elements, through their smaller size, self 
resonance, and greater bandwidth, offer a practical realization of sidelobe reduction in arrays of small form 
factor, and/or arrays which have limited ‘real estate’ for small offsets. 

4. Phased Array Simulation 

Aided by the one-dimensional analytical simulations, we chose to simulate the two dimensional array of 
fractal elements using the method of moments engine NEC4. Accuracy of mutual coupling and ohmic losses 
further guided us to choose this route. 

Each MI2 element consisted of 100 wires sampled by 2 to 3 segments per wire. We chose an array of 4 x 4 
elements separated, without offsets, by A at 450 MHz in vertical and horizontal dimensions. We had the 
option, and used, voltage weighting of each element in the array to produce tapering with 0.5-1-1-0.5 
tapering per column. Furthermore, we employed constant phase offsets on elements 2 and 3 of each array 
column, to produce consistent sidelobe reduction across a variety of steering angles. 

We offset or ‘dithered’ the spatial position of the elements to produce the sidelobe reduction suggested by the 
one-dimensional case. These ditherings were not designed for optimization but for illustrating the effect. 
We considered a maximum horizontal dither of about and tried several cases to determine typical 
sidelobe reductions. 
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As a comparison, as above, we used uniformly spaced, loaded 2/4 dipole elements to simulate a typical array 
of patch-sized elements. This array was tapered as above for each column and was phase steered uniformly 
per column. 

To save computational time, neither array was backed by a reflector, and thus the results yielded bi¬ 
directional beam patterns in both cases. 


5. Results and Discussion 

Figure 2 shows the uniform dipole array, with accompanying azimuthal power patterns in Figures 3 and 4 for 
zero and 30 degrees steer, respectively. Similarly, the fractal element array and its concurrent results are 
found in Figures 5 through 7. 

The results indicate dramatic sidelobe reductions for the fractal element array. In particular, at 30 degrees off 
boresite, the dipole array suffers from a grating sidelobe, which is reduced by 9.4 dB in the fractal element 
array. Other sidelobes experience reductions of 3-5 dB. 

We note that the improvements for the MI2 panel array simulations are even larger than we obtained for the 
1-D case considered in section 2. In the 1-D case we are modifying the structure of ordinary sidelobes in an 
array with 2/2 spacing, while in the MI2 panel simulations the original spacings were by 2. and the 
amplitudes of the grating sidelobes was reduced. 


6. Conclusions 

Using fractal elements offers an opportunity to reduce sidelobes through a dithered offset, of order 2/8. Here 
we have demonstrated that this proves a practical means of achieving such reductions, without large offsets 
in the array. The method further benefits from more conventional methods of sidelobe reductions, such as 
tapering, and can be used in conjunction with it. 
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Figure 2: Baseline dipole array for NEC4 simulations. A 4 x 4 array of uniformly spaced loaded dipoles 
with taper as indicated in the text. 



Figure 3: Zero beam steer beam pattern for the baseline dipole array shown in Figure 2. 



Figure 4: Beam pattern for 30° beam steer of the baseline dipole array. Note the presence of a large grating 
sidelobe. 


827 
















■12 panel array dlth 1 


U-Z9-W9 lb.ZZ'.i? 



Figure 5: Array of MI2 dipoles with dithered positions in the xz-plane. 



Figure 6: Beam pattern at zero beam steer for the dithered MI2 panel array. 



Figure 7: Beam pattern at 30° beam steer for the dithered MI2 panel array. Note that the amplitude of the 
grating sidelobe is significantly reduced. 
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Abstract- There has been considerable recent interest in the radiation characteristics of self- 
scalable and self-similar planar arrays. For instance, array configurations based on Sierpinski 
carpets have been considered in [1,2]. The properties of self-scalable concentric circular 
Weierstrass arrays and self-similar concentric circular Cantor arrays have also been investigated 
in [3] and [4] respectively. In this paper we present an alternative design methodology for the 
mathematical construction of self-scalable and self-similar planar arrays. The technique is very 
general and consequently provides much more flexibility in the design of fractal and related 
arrays when compared to other approaches previously considered in the literature. This is 
primarily due to the fact that the generator in this case is based on a concentric circular ring array. 

1. Introduction 

A rich class of fractal and related arrays exist which can be formed recursively through the 
repetitive application of a generating array. A generating array is a small array at scale one ( P=1 ) 
used to recursively construct larger arrays at higher scales (i.e., P>1). In many cases the 
generating array has elements that are turned on and off in a certain pattern. A set formula for 
copying, scaling, and translating of the generating array is then followed in order to produce a 
family of higher order arrays. Hence, arrays which are created in this manner will be composed 
of a sequence of self-scalable or, in some cases, self-similar subarrays. 

The array factor for such self-scalable or self-similar arrays may be expressed in the general 
form [1,2] 

AF f (y/) = PI GA{8 p ^tf/) (1) 

P =1 

where GA(y/) represents the array factor associated with the generating array. The parameter 8 
is a scaling or expansion factor which governs how large the array grows with each recursive 
application of the generating array. We may regard (1) as representing a formal statement of the 
pattern multiplication theorem for self-scalable and self-similar (i.e., fractal) arrays. 

2. The Concentric Circular Ring Array Generator 

The generating array factor for the concentric circular ring array may be expressed in the 


form [5] 

M N m 

= (2) 

m=] n~\ 

where 

Vm(0’<f>) = Ks™0cos(<fi-i/> mn )+a mn (3) 

k~2n!X (4) 

M = Total number of concentric rings (5) 

N m = Total number of elements on the wth ring (6) 

r m = Radius of the wth ring (7) 

I mn = Excitation current amplitude of the nth element on the mth ring (8) 
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a mn — Excitation current phase of the wth element on the mth ring (9) 

The array factor for a particular stage of growth P may be derived by substituting (2) directly into 
(1). The resulting expression for the array factor is then 

P f M N m ) 

AF f {e,i)=f[P£Y 4 l m ,e JS ''''"™ (10) 

p =1 [«=1>J=1 J 

3. Examples 

Several different examples of arrays that may be constructed via a concentric circular ring 
subarray generator will be presented and discussed in this section. The mathematical expressions 
that describe the radiation patterns of these arrays are all special cases of (10). 


3.1 Planar Square Arrays 

The radiation characteristics of Sierpinski carpet arrays have been considered in [1,2]. The 
generating subarray for a Sierpinski carpet may be represented by two concentric four-element 
circular arrays. Hence, the normalized Sierpinski carpet array factor may be expressed in the form 

AF p (9,t) = ±flii eJ3P '' M (H) 

O p=\ m =1 ns 1 

where 

¥ m (6,<!>) = Vw X [Sin0cos(<*Ssin 6 0 cos(<z> 0 (12) 


The geometry and the array factor for the first four stages (i.e., P =1 - 4) of a Sierpinski carpet 
array are shown in Fig. 1. 


3.2 Planar Triangular Arrays 

The first category of triangular arrays that will be considered are those that can be constructed 
from the uniformly excited three-element circular subarray generator. This three-element circular 
array of radius r = A/(2-\/3) can also be interpreted as an equilateral triangular array with half¬ 
wavelength spacing on a side (i.e., d=2J2). The normalized array factor associated with this 
triangular generating subarray is 


A 1 P 3 jgP~ ‘ 

AF p (0,<t>) = ±rYYL e 

p =1 n=l 


- sin ^ cos(^-^ )+a„ | 


where 


<*„=(«-!) y 


(15) 


a„ = --|sin* 0 cos(* 0 -<p J 

If we choose S= 2, then (14) becomes 

3 p~\ n~l 

where 


(16) 

(17) 
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V„ = ^[ sin 9 C0S W - <t >„) - sin 0 O cos(?> 0 - (j > n )] 

This represents the array factor for a triangular Sierpinski carpet array. 

The second category of triangular arrays that will be explored are produced by a six-element 
generating subarray. This generating subarray consists of two three-element concentric circular 
arrays with radii r, =XI(2yl3) and r 2 = A/(VJ). If we treat the generating subarray as a pair of 
three-element concentric circular ring arrays, then it follows from (10) that the normalized array 
factor in this case may be expressed as 

AF p (6,t) = ( 18 ) 

7 p=\ m=l «=1 

where 

I m =(2lm) (19) 

( 20 ) 

<j> m „ = (2w+m-3)y (21) 

a m „ = -kr m sin 0 O cos((Z> 0 -<j> m „) (22) 

We will next consider a special case of (18), namely when 5 = 2. In this case (18) reduces to 

AFAW = (23) 

7 p=\ m=\ »=! 


W, (0, «*) = [sin 6 cos(^ -</>„„)- sin 0 O cos(^ 0 - </> mn )] (24) 

Fig.2 illustrates the geometry for the first three stages of this triangular array and the 
corresponding current distribution. This type of construction scheme can be exploited in order to 
realize low-sidelobe array designs. A sidelobe level of at least -20dB can be achieved by higher- 
order versions of these arrays. 

3.3 Hexagonal Arrays 

The standard hexagonal arrays are formed by placing elements in an equilateral triangular 
grid with spacings d (see for example Fig. 2.8 of [6]). The normalized form of the hexagonal 
array factor is given by [6] 


j[k rpm sin 6 cos(^-^„ )+a pfrj! ] 


AF r (8,fl)=- 


p =1 m -1 n=0 

P ~p 5 
p~l m =1 «=0 


r pm =d^p 2 +(m-\) 2 -p(m- 1 ) 

,^r 2 +d 2 p 2 -d 2 (m-\) 2 

= cos —- 

L 2 r pm dp 

a pm = -fopm Sin6> 0 cos(0 o - 
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and P is the number of concentric hexagons in the array. 

We now consider a uniformly excited six-element circular array of radius r = AJ2 as the 
generating subarray. Consequently, the normalized array factor associated with this generating 


subarray may be represented as: 


AF P {9,<p) = 

6 p -j n =1 

(29) 

where 


(*„=(»- Df 

(30) 

a„ = -x sin 0 O cos(0 o -<?>„) (31) 

Substituting the value of 8 =2 into (29) yields an expression for the recursive hexagonal array 
factor given by 

0 n ~\ 

(32) 

where 

V„ (0J) = 4sin 6 cos(^sin 0 O cos((z) 0 - <p „)] 

(33) 


Fig.3 shows the element locations and current distributions for the first four stages of this 
hexagonal array. One potential advantage of these arrays is that they are thinned and, therefore, 
may be realized with fewer elements. Another advantage of these arrays is that they possess low 
sidelobe levels, as indicated in Fig. 4. Finally, the compact product form given in (32) offers a 
significant advantage in terms of computational efficiency, especially for large arrays. For 
example, this property may be exploited to develop rapid beamforming algorithms. 

When an element with two units of current is added to the center of the hexagonal 
generating subarray shown in Fig. 3, the array factor given in (29) must be modified in the 
following way: 

AF,iw = ±n{ 2+ i eiS '~ M \ (34) 

This allows a further reduction in sidelobe levels to be achieved and leads to a family of fully- 
populated hexagonal arrays. 
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Fig.l. Scale 1 is the generator subarray. Column 2 is the geometrical 
configuration of the Sierpinski carpet array: white blocks re¬ 
present elements that are turned on and black blocks represent 
elements that are turned off. Column 3 is the corresponding 
array factor where the angle phi is measured around the cir¬ 
cumference of the plot and the angle theta is radially measured 
from the origin at the lower left. 
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Fig.2. The geometry and current distribution for the first three 
triangular arrays. 
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Fig.3. The element locations and associated current distributions for 
each of the four hexagonal arrays . 
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Three Dimensional Scattering Analysis in Stratified Medium 
using Fast Inhomogeneous Plane Wave Algorithm + 


Bin Hu and Weng Cho Chew 
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(Dedicated to the memory of James R. Wait for his expansive contributions 
to electromagnetics and his boundless energy) 

1. Introduction 

The scattering of electromagnetic waves in the presence of layered medium 
is a subject of great practical interests. Major applications include, but are 
not limited to, microstrip antenna and MIC. Among many techniques which 
have been proposed for this purpose, the Method of Moments (MOM) so¬ 
lution of the integral equation encountered in the layered medium studies, 
often named as full wave analysis method, has received considerable atten¬ 
tion. However, two obstacles limit the analysis of practical problems. 

First, for the layered medium problem, the spatial domain Green's func¬ 
tion can be cast into a Sommerfeld-type integral (SI) from which no general 
analytic closed form is available. Numerical integration along the Sommerfeld 
Integration Path (SIP), though possible, converges too slowly to be practical. 
Second, the basic idea of MOM is to transform an integro-differential equation 
to a matrix equation, which is a dense one in the general case. Solving this 
matrix equation by using traditional scheme, such as Gaussian elimination, 
which requires 0(iV 3 ) CPU time, or iterative solver with direct matrix vec¬ 
tor multiplication, whose computational complexity scales as 0(iV 2 ), quickly 
exhausts the currently available computing resources. 

To address this problem, Fast Multipole Method (FMM) coupled with 
Discrete Complex Image Method (DCIM) has been proposed to solve strati¬ 
fied medium problems efficiently [1]. However, as we pointed out in [2], this 
approach is far from being optimal, partially because DCIM is basically an 
asymptotic method and not rigorous. 

In this report, we propose a new approach, named Fast Inhomogeneous 

fThis work was supported by AFOSR under MURI Grant F49620-96-1-0025. 
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Plane Wave Algorithm (FIPWA). The major idea of this approach is to ex¬ 
pand the Green’s function encountered in layered medium study as the sum¬ 
mation of the inhomogeneous plane wave, which is sampled along a carefully 
chosen Steepest Descent Path (SDP). By using the interpolation and extrap¬ 
olation technique, the translation matrix is diagonalized. The advantages 
of this method, compared to the FMM/DCIM approach, include efficiency, 
accuracy, and versatility. 

Throughout this paper, e~ lut time convention is assumed and suppressed. 

2. Fast Inhomogeneous Plane Wave Algorithm in Layered Medium 

2.1. Basic Formulation 

By setting up the coordinate system as shown in Figure 1 and assuming 
that the scatterer is located entirely above the layered medium, the dyadic 
Green’s function, which is expressed in the Sommerfeld-type integral, can be 
derived. For illustration purpose, the G xx component is formulated as 

G-(r*r,) = gjT , dk^Ht\kA Pj - Pi \) (e*” 1 *"*' + R TE 

( 1 ) 

where d is the distance from the source point, i.e., t*, to the interface of 
the layered medium and R TE (k p ) is the generalized reflection coefficient [3]. 
Separating Equation (1) into the direct and reflected field term, denoting 
Tii — XiX + y^y + (zi — 2d)z as the mirror image point of the source point r t -, 
and using k p = k sin 6 and k iz = k cos 9 , we have 

Gtxir^Vi) = J d9sin$H?\k8mO\ Pi - (2) 

G^(r,;ri) [ d6 sin 6R TE {k sin d)H^ ] (k sin 0\ Pj - Pi , 

07 T J T I 

( 3 ) 

where T and F 7 are the properly chosen paths on the complex 9 plane. Ap¬ 
plication of FIPWA to the first term, which is the Green’s function for free 
space, has been studied in [4]. Therefore, we will concentrate our discussion 
on the second term, which is used to account for the reflected wave contribu¬ 
tion from the layered medium. 

2.2. Steepest Descent Path 

To determine the Steepest Descent Path for 9 so that the integral con¬ 
verges quickly, we first use the asymptotic expansion of the Hankel function, 

i-e., 
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Therefore, asymptotically, the integrand in Equation (3) takes the form 
e ikpsme+ikzcos9 The determination of the SDP for this kind of integrand 
has been extensively studied in the 2D FIPWA case [5]. In summary, to 
avoid the exponential large peaks, a modified SDP is defined and is shown in 
Figure 2. However, it is important to note the difference between this path 
with the one in the free space. After changing the integration path from the 
SIP to SDP, the possible pole and branch point contribution, as illustrated 
in Figure 2, cannot be neglected. 

Therefore, denotes f(9) = k sin 9 R TE (k sin 9), {9 qii w qi ) as the quadrature 
coefficients on the SDP path, 9 P as the contributing poles, and (9 b , Wb) as the 
quadrature parameters on the branch cut, Equation (4) can be rewritten as 

GLto,*) =G'f p (r J ,r i ) + G*( r*r,) + Gg(r„ r<) 

»gf E w q J{e n )H^\k Pii sin ^ 

+ lrE Res W sin °p h o 1} (**# sin d p) e<kZii ‘ 005 6p 

v 

+ ^E u '» (/ + W - rm) H^(kp Jt sme b )e ik V C0 ^ (5) 

b 

where Res{# p } denotes the residue of R TE (ksin9) at 9 = 9 P and f + (9) cor¬ 
responds the upper Riemman sheet of kp? z of the outmost layer. For layered 
medium backed with PEC ground plane, the contribution from the branch 
point disappears, as we have pointed out in [2]. G r xx DP , G r xx , and G xx 
represent the integral along SDP, the pole and branch point contribution, 
respectively. Up to this point, it is worth to point out that only f(9) changes 
for different dyadic components. 

2.3. Diagonalization 

Before proceed to diagonalize the translation matrix, we group the source, 
i.e., Tii , and observation point, i.e., Tj, into groups, which are centered at 
Tin and rj, as shown in Figure 1. Then, by applying the 2D Fast Multipole 
Method, the Hankel function in Equation (5) can be expressed as the summa¬ 
tion of the two dimensional inhomogeneous plane waves. After that, G r x s x DP 
in Equation (5) can be expressed as 

G r x S x DP (T j , Ti) 

= g^ 

f(0<nK? FMM (k sin , <t> n )e ikZu " cos9 « 

=EE fijl{@qi 5 ) ‘ Til’ 1 (^ 9 ii ^92) * PvtitiQqii $92) (®) 

9i 92 
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where 


Twt{O qi ,K) = g^ I ^ 2 /(^ 1 )T i ^™ M (fc s m0, 1) 0 ?2 )e to «'' cos ^ 1 (7) 

In the above equations, stands for the translation coefficients for 

2DFMM and $//*/ and fiji represent the far field pattern function for the 
source and observation groups, respectively. 

Denoting J(D 9 ,fi s ) as the interpolation and extrapolation kernel, where 
Q q = (0 9l , 4> q2 ) is the quadrature samples in Equation (6) and Q s the samples 
on real 6 and <j> space, we have 

g£ dp ( xj,T t ) ajMnj-frtin.) 

q s 

= Y, &( n «) ■ T n „ (fi s ) • /w (n s ) (8) 

s 

where __ 

T u „(n s ) = '£i(n 9 ,n a )T u „(Q q ) ( 9 ) 

q 

It is easy to see that Equation (8) yields a diagonal translator, T^/(Q 5 ). 
For the pole and branch point contribution, similar manipulation can be 
performed. 

To this point, it is worth to note some merits of this approach in layered 
medium. 

Observation 1. The information of the layered medium is embedded in 
the generalized reflection coefficient, i.e., R TE in Equation (1). Therefore, 
the algorithm developed above is independent of the structure of the layered 
medium, such as the number of layers, the thickness and the electromagnetic 
property of each layer. 

Observation 2. It is not difficult to see that the contribution from the 
reflected wave can be accounted for by one term, which is equivalent to placing 
a source at the mirror image of the source point, although strictly speaking, 
no image source really exists. This property is desirable when analyzing non- 
planar structures, which currently is not feasible in FMM/DCIM approach, 
due to the non-linear relation between the source point and its complex image 
terms. 

Observation 3. The computational complexity for including the reflected 
wave contribution is the same as that for free space, which is O(NlogN) if 
multilevel scheme is used. 

3. Numerical Results 

For illustration purpose and without loss of the generality, we use a simple 
microstrip structure to validate our algorithm. 

In Figure 3. the structure under study, which is a PEC sphere with radius 
R = 0.5A on top of a single layer medium backed with PEC ground, is shown. 
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The working frequency is chosen at / = 0.15 GHz. The permittivity of the 
substrate is £ r = 2.56. The distance between the center of the sphere and the 
air-dielectric interface is 0.75A. 

Figure 4 and 5 plot the scattered field when the thickness of the sub¬ 
strate is chosen as t = 0.25A and the plane wave is incident from ( 9 mc , (f> inc ) = 
(0°, 0°). The results are compared with those computed using the traditional 
CG solver with the matrix elements generated by direct Sommerfeld inte¬ 
gration and direct matrix vector multiplication. The agreement is excellent. 
In Figure 6 and 7, we compare the current distribution on the sphere with 
or without the presence of the layered medium. The effect of the layered 
medium is obvious. 


4. Conclusion 

In this report, we propose the Fast Inhomogeneous Plane Wave Algorithm 
for the efficient analysis of stratified medium problems. Compared to the 
previously developed methods for this purpose, the new algorithm has several 
advantages, such as being simple, accurate, versatile for arbitrary scatterer 
shape and layered medium structure. Applying this approach to more general 
problems encountered in the layered medium, such as the electromagnetic 
scattering from the buried objects within lossy medium, is feasible in principle 
and is currently under investigation. 
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Figure 1. Structure under investigation. 


Figure 2. The steepest descent path. 


Figure 3. A sphere on top of one layer microstrip structure 




Figure 4. Scattered field in xz plane. 


Figure 5. Scattered field in yz plane. 



Figure 6. Current Distribution in Free 
Space 


Figure 7. Current distribution with lay¬ 
ered medium 
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Abstract 

An efficient algorithm combining multilevel fast multipole method and the discrete complex 
image method is presented for analyzing large-scale microstrip structures. The resulting algorithm 
has the memory requirement and the CPU time per iteration proportional to 0(N log AT), where 
N denotes the number of unknowns. Numerical results for microstrip antennas are presented to 
demonstrate the efficiency and accuracy of this method. 

1 Introduction 

The method of moments (MoM) has been widely used for the analysis of microstrip structures, 
such as microstrip antennas, microwave integrated circuits, and microstrip interconnects. To sim¬ 
ulate large-scale complex microstrip structures, it is often necessary to employ a large number of 
unknowns. For the conventional MoM, the memory requirement is proportional to 0(N 2 ). The 
computing time can also become very excessive because direct matrix inversion solvers require 
0(N Z ) floating-point operations. When an iterative solver is employed for solving the MoM matrix 
equation, the operation count is 0(N 2 ) per iteration because of the need to evaluate the matrix- 
vector multiplication. This operation count is too high for an efficient simulation. 

To make the iterative method more efficient, it is necessary to speed up the matrix-vector 
multiplication. There are several techniques developed for this purpose, including the adaptive 
integral method (AIM) [1], the fast multipole method (FMM) [2]-[4], and the singular value de¬ 
composition based algorithm [5]. Recently, effort has been made to extend these fast algorithms 
to microstrip problems. The AIM is successfully adapted in [6] with the aid of the discrete com¬ 
plex image method (DCIM) [7]. Extension of FMM is difficult because of its dependence on the 
Green’s function. One approach is to combine FMM with DCIM [8]—[10]. In [8] and [9], which 
treat the static and 2-D problems, the equivalent problem is set up by adding N c images at the 
corresponding complex coordinates, and therefore, represented by AT(iV c -4-1) basis functions. In the 
FMM implementation, the translation is different for different images. In [10], both the 2-D and 
3-D FMMs are employed because the surface-wave poles are extracted in DCIM, which makes the 
implementation complicated. The multilevel algorithm is not implemented in those analyses. The 
other FMM approach is to express the Green’s function in terms of a rapidly converging steepest 
descent integral and then to evaluate the Hankel function arising in the integrand by FMM [11]. 
This approach is good for thin-stratified media. 

In this paper, the multilevel fast multipole algorithm (MLFMA) [2], [4] combined with DCIM 
is presented for efficient analysis of microstrip structures. Instead of being treated separately, the 
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image sources are grouped with the original source. By the use of the multilevel algorithm, the 
complexity is reduced to O(NlogN). The algorithm requires little extra computation compared 
with that applied to free-space problems. Numerical results for microstrip antennas are presented 
to demonstrate the efficiency and accuracy of this method. 

2 Formulations 

Consider a general microstrip structure residing on an infinite substrate having relative permittivity 
e r and thickness h. The microstrips are in the x-y plane and excited by an applied field E Q . The 
induced current on the microstrips can be found by solving the well-known mixed potential integral 
equation (MPIE) [12]. First, the microstrips are divided into triangular elements and then the 
current is expended using triangular rooftop basis functions. Applying Galerkin’s method results 
in a matrix equation 

Z • I = V (1) 

in which the impedance matrix Z has the elements given by 

[fi(r) fi(r')G a (r.r') 

-^V • fj(r)V' • f,(r')G,(r, r')] dr'dr (2) 

where £ and fj represent the testing and basis function, respectively, Ti and Tj denote their supports, 
G a is the sx-component of the Green’s function for vector potential, and G q is the Green’s function 
for scalar potential. In general, both G a and G q can be expressed as an inverse Hankel transform 
of their spectral domain counterparts, which is commonly known as the Sommerfeld integral (SI). 
The analytical solution of the SI is generally not available, and the numerical integration is time 
cons umin g. This problem can be alleviated using DCIM [7], which yields closed-form expressions 
as 

Nc p-jkr p 

G(r,r') = £ r j» = l r “ ( r ' + Sb p )I ( 3 ) 

p=o 47rr P 

where a p and b p are the complex coefficients obtained from DCIM. 

To use FMM, we first divide the entire structure into groups denoted by G m (m = 1,2, • • •, M). 
Letting r i be the field point in group G m centered at r m , and tj be the source point in group G m * 
centered at r m /, we have 


r ij = ri- (rj + zb p ) 

- (r* - r m) + (rm — r m /) + (r m . - Tj) - zb p 

= r im d" W ^jm' zb p . (4) 

Employing the addition theorem and the elementary identity [3], we can rewrite the Green’s function 
in (3) as 


G(ti, Tj) 


k 

jl6n 2 



x • k)d 2 k 


( 5 ) 
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where 

£, 



T(r mm ' • k) = 4- l)h/ 2) {kr mm i)Pi(r m7n > • k). 

1=0 

(6) 

Substituting (5) into (2), we obtain 



Zij = ~ [f S a (k)V im (k) ■T{f mm ,,k)V jm ,(k)d 2 k 



~j S q (k)Vi m (k)T(f mm i , k)v; m , (fe)j <P'k 

(7) 

where 

Ui m (&) = f e“ jkr ^fjdr 

JTi 

(8) 


V im (k) = j ( e -3k r ‘ m V • tidi 

(9) 


s{k) = Yl a P e,kiip - 

(10) 


p - 0 


When an iterative method is used to solve (1), the matrix-vector multiplication can be per¬ 
formed in such a way that the contributions from nearby groups are calculated directly and the far 
interactions are calculated using (7). 

The concept of FMM can be extended to multilevel using MLFMA [4]. To implement MLFMA, 
the entire solution region is first enclosed in a large box, which is divided into four smaller boxes. 
Each subbox is then recursively subdivided into smaller boxes until the edge length of the finest box 
is less than half a wavelength. For two elements in the same or nearby finest boxs, their interaction 
is calculated in a direct manner. However, when the two elements reside in different nonnearby 
boxes, their interaction is calculated by FMM, as described above. The level of boxes on which 
FMM is applied depends on the distance between the two elements. 

3 Numerical Results 

Before the proposed method is applied to realistic problems, the accuracy of this algorithm is 
examined. Consider a microstrip line on a substrate with relative permittivity e r = 2.17 and 
thickness h = 1.58 mm. The frequency is 3.0 GHz. The line is 5 mm wide and 400 mm long. As 
shown in Fig. 1, the line is discretized into triangular elements with edge length 5 mm. Fig. 1 shows 
the values of matrix elements Z\j(j — 1,2, - • ,80) obtained by using two different approaches. One 
approach is to use the original formulation (2). The other approach is to use MLFMA, where the 
group size d is 0.25 Aq with Xq being the wavelength in free space. Note that Z\j(j = 1,2, • • •, 14) is 
considered as the near interaction and is calculated directly, and Zij{j = 15,16, ■ • •, 80) is considered 
as the far interaction and is calculated by MLFMA. As seen from Fig. 1, these two approaches agree 
well. In this calculation, the number of modes L is chosen to be kd + 31n(7r + kd). 

Next, the complexity of this algorithm is evaluated. The CPU time per iteration and the 
memory requirement versus the number of unknowns are plotted in Fig. 2 for solving the problem 
of a rectangular microstrip patch antenna. It is seen that the CPU time per iteration is scaled as 
O(NlogN) and the memory requirement is scaled as 0{N). 

Now consider a corporate-fed microstrip antenna array, which involves 6569 facets and 8668 
unknowns. At frequency / = 9.42 GHz, the current distribution and the radiation patterns in the 
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Figure 1: Matrix element Z\j(j = 1,2, • - ■ ,80) calculated by the conventional MoM and MLFMA. 



Number of Unknowns 


Figure 2: Complexity of MLFMA. The CPU time per iteration is close to 9 x 10 6 A'log N and the 
memory requirement is close to 7 x 10 ~ 3 N. 

two principal planes are given in Fig. 3, which shows excellent agreement between this method and 
the conventional MoM. For the conventional MoM, the memory requirement is over 600 MB and 
the CPU time per iteration is 15.8 seconds. However, it takes only 36.3 MB and 3.0 seconds for 
the 5-level MLFMA. The MLFMA also yields an over 70 percent reduction in the CPU time for 
the matrix fill comparing to the conventional MoM. 
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Figure 3: Current distribution and radiation patterns of the corporate-fed microstrip antenna array. 
e r = 2.2, h — 1.59 mm, l = 10.08 mm, w ~ 11.79 mm, d\ — 1.3 mm, = 3.93 mm, l\ = 12.32 
mm, I 2 — 18.48 mm, D\ ~ 23.58 mm, £>2 = 22.40 mm. / = 9.42 GHz. 


4 Conclusion 

This paper presents a fast algorithm to deal with large-scale microstrip problems. The MLFMA 
originally developed for free-space problems is extended to microstrip problems with the aid of 
DCIM. The complexity of this algorithm is scaled as 0{N log N). The efficiency and accuracy of 
this method is demonstrated by numerical results. 
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Abstract 

A new approach is proposed to enhance the efficiency and reduce the memory 
requirements of the multilevel fast multipole algorithm (MLFMA) when applied to the 
higher-order Galerkin’s method. This approach represents higher-order basis functions 
by a set of point sources such that a matrix-vector multiply is equivalent to calculating 
the fields at a number of points from given current sources at these points. The 
MLFMA is then applied to calculate the point-to-point interactions. This permits the 
use of more levels in MLFMA than applying MLFMA to basis-to-basis interactions 
directly and thus reduces the memory requirements significantly. 

1 Introduction 

The electromagnetic (EM) interactions with arbitrarily shaped objects can be characterized 
by seeking the solution of an integral equation whose unknown function is the induced 
current. In this process, the integral equation is first discretized into a matrix equation 
using the Galerkin-based method of moments (MoM) with subdomain basis functions such 
as roof-top functions for curvilinear quad patches and Rao-Wilton-Glisson (RWG) functions 
[1] for triangular patches. It is convenient to model objects with arbitrary shape using 
triangular patches and hence RWG functions are widely used for representing the unknown 
current distribution. 

When iterative solvers are used to solve the MoM matrix equation, the calculations of 
matrix-vector multiplys can be accelerated using fast multipole method (FMM) or multilevel 
fast multipole algorithm (MLFMA) [2-4]. The FMM reduces the complexity of a matrix- 
vector multiply from 0(N 2 ) to 0(N 15 ), where N is the number of unknowns. The MLFMA 
further reduces the complexity to 0(N\ogN). The memory required by MLFMA is only 
O(NlogN) and hence one can solve very large problems on existing computers. Integral- 
equation solvers that use RWG basis functions and employ the standard implementations of 
MLFMA such as FISC [5,6] and ScaleME [7-9] have been very successful in solving large- 
scale problems. However, these basis functions have a poor convergence and need a large 
number of unknowns for a desired accuracy. For example, a sampling rate of 10 unknowns per 
wavelength (about 200 unknowns per square wavelength for quad patches or 340 unknowns 
per square wavelength for triangular patches) typically results in an accuracy of O.ldB in 
the radar cross section (RCS) calculation. The average edge size of the patches is about 
0.1 A. As a result, it is very expensive to achieve a high accuracy by using RWG bases or any 
other low-order functions. Hence, it is imperative to use higher-order basis functions [10,11] 
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with vastly superior convergence properties to obtain accurate results using only a fraction 
of the unknowns required by low-order basis functions. It has been demonstrated [11] that 
using higher-order basis functions, the sampling rate for an accuracy of O.ldB in RCS can 
be reduced to 40 ~ 70 unknowns per square wavelength with an average edge length of 
0.5 ~1A. 

However, when MLFMA is used to accelerate the solution of the higher-order Galerkin- 
based MoM [12], its performance is limited for the reason explained below. In MLFMA, only 
the near interactions of the MoM matrix are computed explicitly whereas the far interactions 
are computed implicitly by using the multipole expansion of current distributions. In this 
procedure, the scatterer is first enclosed in a large cube, which is then divided into eight 
equally sized small cubes. Each of the small cubes is further divided into eight smaller cubes 
recursively until the smallest cube size is about several times bigger than the longest patch 
edge. Thus, the edge length limits the number of levels used in MLFMA. 

In Galerkin-based MoM using RWG basis functions, since the edge length is about 0.1 A, 
the finest cube is about a quarter of a wavelength. However, because of the large patch size 
used with higher-order basis functions, the number of levels in MLFMA is at least one or 
two less than that for RWG basis functions. In MLFMA, the radiation pattern of each basis 
is calculated and stored. The number of samplings ( K ) in the radiation pattern is given by 

K = 2 L 2 , L = kd + a(kd) 1 ' 3 , (1) 

where k is the wavenumber, a depends on the accuracy, and d is the diameter of the group 
size, which is bigger than the cube as shown in Figure 1(a). Each higher-order basis needs 
much more samplings than the RWG basis. The average number of unknowns required is 
about 1.5 per patch for RWG basis, 5 and 10.5 for the first- and second-order basis functions, 
respectively. Thus, the number of unknowns in each group for higher-order basis functions 
is more than that for RWG basis. Therefore, more near interactions have to be calculated 
and stored for each higher-order basis. Consequently, MLFMA cannot work efficiently with 
higher-order Galerkin’s method. 

To overcome this problem, we propose to implement MLFMA based on point-to-point 
interactions, instead of the traditional basis-to-basis interactions. When we calculate the 
matrix elements for which the testing and source bases are not close to each other, we can 
apply Gaussian quadrature to evaluate the integrals. This process can be interpreted as 
replacing a continuous source distribution with discrete sources as shown in Figure 1(b). 
Thus, one matrix-vector multiply is similar to the calculation of the electromagnetic fields 
for a given distribution of N source bases and then testing them with these bases. In this 
implementation, we first find Q equivalent point sources from these N source bases, then 
calculate electromagnetic fields at these Q points, and finally test them with each testing 
basis. The value of Q depends on the number of patches and the quadrature rule used for 
each patch. The MLFMA is used to calculate electromagnetic fields at Q points generated 
by Q point sources. By doing so, the number of levels used is not limited by the size of basis 
functions, making MLFMA more efficiently. Furthermore, the near interaction part of the 
MoM matrix is redefined as the difference between the original matrix and the interactions 
calculated by MLFMA. Consequently, the memory requirement can be reduced as well. In 
the following section, this approach is first described in detail. Numerical results validating 
the point-based method are then presented, which are followed by the conclusion. 
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Figure 1: Applying MLFMA to (a) basis-based sources and (b) point-based sources. 

2 Applying MLFMA to Point-Based Sources 

By testing the electric field integral equation (EFIE) with the basis function tj , the matrix 
element Zji is written as 

Zji = J dStjix) • J dS’ G(r, r') • t,(r'), (2) 

where the dyadic Green’s function is given by 

Ctr r n _ jfo7 f T W'1 e ‘ fc|r ~ r '' 

J 47r k 2 \r — r'| 

and r/ denotes the impedance. When the testing basis (tj) is sufficiently far away from the 
source basis (£*), the integrals can be evaluated by using Gaussian quadrature to give 

zf = E Wjptj(tp) • E G(t p , r ? ). WiM**) 

P~ 1 9=1 

= E J jM • E G( r ?> r ?) • 

p=l 9=1 

where Q is total number of quadrature points on all patches, Wj p is the quadrature weight 
at the pth point, which is non-zero only on the patch supporting the basis tj , and Jj(r p ) is 
defined as Wj P tj(r p ). Correspondingly, the matrix-vector multiply for the far interactions is 
written as 

EZjfzi = E J A r P> • E ®( r J» r ?) • E (3) 

i=l p =1 q -1 i— 1 

where Xi is the coefficient of the zth basis function. By defining 

# P = E G(r P , r 9 ) • J„ J, = E (4) 

9=1 i=l 
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Equation (3) is simplified as 


E^ = I^(r i)-*r 

i—1 p-1 

The above formulation means that for a given distribution of N source bases, calculate the 

electric fields and test them with these bases. The MLFMA can be used to calculate the 

fields generated by current sources very efficiently. Defining a variable I pq such that 

r _ / 1 P~Q interaction is calculated by MLFMA, 
vq ~ | 0 otherwise, 

the matrix-vector multiply calculated by MLFMA is modified as 

E 2 t x i = E J ifa) • E 4«Gfa> r ? ) • J2 (5) 

i=1 p=l q -1 *=sl 

Now, the near interaction part of the MoM matrix is redefined as the difference between the 
original matrix and the interactions calculated by MLFMA, so we have 

Zjr = Zji - Z%, (6) 


Zfr = E ■ E 4 ? G(r p , r,) . J;(r, 


7near _ 7 vfar 

= f dStj(r) • / dS'C^r')*^') - £ J^r,) • £ 4 9 G(r p , r 9 ) • J,(r fl ). 

J J v q 

If the testing basis tj and the source basis belong to the nearby groups but are not very 
close to each other, the same quadrature rule used for evaluating the far interactions is used 
to calculate the first term of the above equation. If all point-point interactions are calculated 
by MLFMA, Z^ ear equals to zero. Therefore, Z Dear is a sparse matrix and has less non-zero 
elements than the matrix for near interactions between bases. 

Since the current vector in Equation (4) has two tangential components only, we write 
it in terms of two independent tangential vectors as 

*»fa) = E ^faWfa)- (?) 

l'=l 

Therefore, current J q at the qth. source point r q is written as 

Jq = E W iq ti(T q )Xi = E w iq E A 1' fata;' fata 

*=1 i=1 I'-l 


= E A <-fa)*?fa 

('=1 


854 





where 


>?(*») = E ^#8-(**)*■ 

1=1 

Substituting the above expression into Equation (3) yields 

E z fi x i = E E WW A i(r„) • E G(r p , Ig) • E A i' (*«)*?( r ?) 

*=1 p=l 9=1 i'=l 

Applying the addition theorem to the dyadic Green’s function [2-3] yields 


(9) 


—. . ikrj f= Wl e 2 ^ rp ~ r<? l 

G( r P ,r g ) = — l r _ r i 


47T l 

ikrj 


J d 2 k (i — kVj e jk ‘( rpm “ r « m/ )a(k,i 


( 10 ) 


where 


a ( k , r mm') = E( _ j)'( 2i + l)^ <2) (^mm')fi(W * k), 

1=0 

Tpq — = l*p r m H" l*m r m' = ^pm 4" 


pm i A mm' A <jm' 5 


and r m and r m / are the group centers for the points p and g, respectively. Consequently, the 
far interactions of the matrix-vector multiply can be written as 


E Zjfxi = E / ^v^k) £ ( k > w) E E v; v (kW W, 

i=l 1=1 

where C/j? = and the radiation and receiving patterns are given by 

V mip (k) = e ik ' r -(I-fcfe)-A i (r p ). 


( 11 ) 


( 12 ) 


Since the point sources do not have any spatial extent, we can refine the finest cube 
in MLFMA as small as a quarter of a wavelength. There are two advantages in applying 
MLFMA to point-to-point interactions. First of all, the group size is the same as the cube 
and there is no overlapping in sources as shown in Figure 1. Secondly, smaller group size 
means that much less samplings over the radiation patterns are needed. As a consequence, 
the memory requirements for the radiation pattern can be reduced. Further reductions can 
be obtained if the radiation patterns are calculated on the fly since no numerical integration 
is needed to calculate the radiation pattern. 


3 Numerical Results 

The inherent advantage in applying MLFMA to the point-based sources is shown by comput¬ 
ing the RCS of a perfectly electric conducting (PEC) sphere with different diameters using 
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Figure 2: Memory requirements of MLFMA as applied to basis-based and point-based 
sources. A PEC sphere and the second-order basis functions are used. 

the second-order basis functions [11] and then comparing the memory needed for applying 
MLFMA to basis-based sources as shown in Figure 2. The discretization is kept constant at 
33 unknowns per square wavelength and the diameter of the sphere is increased from 5.4A 
to 18A. The corresponding bistatic RCS patterns are plotted in Figures 3 and 4. In Figure 3 
the Mie series solution is compared with the basis-based approach and in Figure 4 the point- 
based and basis-based methods are compared. Compared with the Mie series solution, the 
RMS error is 0.138dB in the basis-based approach and 0.146dB in the point-based approach. 
The MLFMA kernel used for all examples in this paper is ScaleME [7-9]. 

4 Conclusion 

A new approach is presented to make the higher-order Galerkin’s method work more effi¬ 
ciently with the MLFMA. The far interactions in the matrix-vector multiply are calculated 
by applying MLFMA to the point interactions. This approach allows one to use two more 
levels in MLFMA than applying MLFMA to basis interactions directly, and hence reduces 
the memory requirements significantly. Further reductions can be obtained if the radiation 
patterns are calculated on the fly. 

Acknowledgment: The authors would like to thank Dr. S. Velamparambil for the special¬ 
ization of ScaleME and his personal communications. 
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Figure 3: Bistatic RCS for a sphere of 18A in diameter calculated by the Mie series and 
MLFMA applied to basis-based sources. The 4-level MLFMA and 820MB are used. 



Figure 4: Bistatic RCS for a sphere of 18A in diameter calculated by MLFMA as applied to 
basis-based and point-based sources, respectively. In the point-based approach, the 6-level 
MLFMA and 390MB are used. 
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Abstract 

Incomplete LU (ILU) preconditioner using the near-field matrix of 
the fast multipole method (FMM) is investigated to increase the effi¬ 
ciency of the iterative conjugate gradient squared (CGS) solver. Unlike 
the conventional LU, ILU requires no fill-ins hence no extra memory 
and CPU time in computing the LU decomposed preconditioner. It is 
shown that due to the nature of the near-field matrix, ILU precondi¬ 
tioning decreases the number of iterations dramatically. 


Introduction 

The FMM has been proposed and successfully applied to increase the effi¬ 
ciency of the method of moments (MoM) solution of large scale electromag¬ 
netic scattering and radiation problems [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11]. The 
FMM is based on the iterative solution of the MoM matrix equation using 
an indirect fast computation of the matrix vector product. The convergence 
behavior is strongly dependent on the condition of the constructed matrix 
as well as the iterative solver being used. Various iterative solvers are avail¬ 
able including the conjugate gradient (CG), biconjugate gradient (BiCG), 
conjugate gradient squared (CGS), quasi-minimal residual (QMR), and gen¬ 
eralized minimal residual (GMRES). The convergence of all these iterative 
solvers is dictated by the matrix condition which typically deteriorates as 
the matrix size increases. Therefore, iterative solutions of such large, fully 
populated matrix systems inevitably requires some kind of preconditioning. 
Otherwise, propagation of numerical errors during the execution of the iter¬ 
ative solution may lead to convergence failure. 

Various preconditioning techniques for improving the condition of the 
system can be found in [12]. Diagonal and block diagonal preconditioners 
for multilevel FMM implementations were reported in [6]. Here we present 
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an ILU preconditioner for MoM and FMM implementations. By its nature, 
the ILU preconditioner is constructed using the near-held FMM matrix and 
is shown to significantly reduce the number of iterations required for con¬ 
vergence. 


Method of Moments and the Fast Multipole Method 


The MoM formulation of electromagnetic scattering problems using 2 nd order 
curvilinear quadrilateral elements for surface modeling is given in [13]. The 
FMM implementation used herewith was built upon that in [13, 14]. 

Briefly, for the electric field integral equation (EFIE) the resulting linear 
system after Galerkin’s testing is of the form 


where 


and 


^^ Zmn a n — ?Tl — 1, 2,. . . , iV 

71=1 

Z mn = Jdsf m (r) ■ J ds' k(r') + pV' - f„(r')V 

Vm = Tv 


R 


( 1 ) 

( 2 ) 

(3) 


Througout, an e~ ,brt time convention has been assumed and suppressed. 
Here, { a n } refers to the column containing the unknown coefficients of the 
surface current expansion 


N 

J 0) 58 D fl njn(r). 

71=1 


(4) 


In the above expressions, r and r' denote the usual observation and 
source point locations, E*(r) is the incident excitation plane wave at r, t 
is the vector tangent to the surface at r, 77 = 1207 T denotes the free space 
impedance, and k = 2i?/X is the free space wavenumber. 

The key aspects of the FMM implementation are 

• Iterative solution of the MoM system of equations [ Z]{a } = {V} 
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• Fast evaluation of the matrix vector product [ Z]{a }. 

Fast evaluation of the matrix vector product (using 0(N X * 5 ) or less resources, 
instead of 0(N 2 )) is attained by approximating the pertinent Green’s func¬ 
tion [1], It can be shown [1] that the CPU time per iteration/matrix vector 
product for FMM is 0(N 1 ' 5 ) instead of 0(N 2 ). 

The above CPU estimates are asymptotic in the sense that they represent 
values which are approached for very large N. The actual efficiency of the 
implementation depends on the choice of various parameters. These choices 
control the constant in front of the asymptotic behavior of the CPU require¬ 
ments. However, choices of the FMM parameters for faster implementation 
inevitably lead to errors in the matrix vector product inducing erroneous 
minima for the iterative solver. This aspect may lead to unacceptable error 
since it may cause the solver to converge at one of these minima, especially 
when non-monotonic solvers (e.g. CG, BiCG, or CGS) are employed. 

The specific integral equation formulation, be it the EFIE, MFIE, or 
CFIE, also affects the solution convergence. 


Preconditioning Necessity 

For large scale simulations possibly with geometrical surface details (e.g. 
antenna arrays on aircraft), the density of the surface mesh cannot be ex¬ 
pected to be uniform. Nevertheless, a nonuniform mesh is well known to 
produce ill-conditioned MoM matrix equations. Also, different formulations 
of the same electromagnetic problem results in different system conditions, 
e.g. the CFIE formulation results in much better conditioned systems than 
the EFIE or MFIE. Also, as noted above, the FMM implementation intro¬ 
duces erroneous minima in the solution domain. Use of a preconditioner is 
therefore essential for robust implementations of iterative solvers. 

Although the diagonal preconditioner is simple and leads to significant 
convergence improvements, it does so for diagonally dominant matrices. 
Block diagonal preconditioners are more robust but require renumbering 
of the grid or matrix rearranging so that the dominant matrix terms are 
clustered around the diagonal. This can easily be done for 2D problems, 
but is very difficult in 3D. Alternatively, when the FMM is used to speed 
up the iterative solution, we have the natural choice of using the near-ffeld 
portion of the MoM matrix for preconditioning. These near-field elements 
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axe the largest in magnitude and constitute the unapproximated portion of 
the system matrix. 

To develop the preconditioner, the near-held matrix can be directly de¬ 
composed via LU. However, depending on the sparsity pattern of the near- 
held matrix, this LU process may require a significant amount of hll-ins. For 
large scale simulations, these hll-ins (hence the increase in memory usage) 
becomes a bottleneck in memory utilization. Alternatively, the hll-in re¬ 
quirement of direct LU can be resolved by using the ILU factorization. The 
ILU algorithm is the same as a direct LU algorithm except that it avoids 
hll-ins of elements in the decomposed LU matrices. This also results in less 
CPU utilization. 


ILU Preconditioner for FMM 

The algorithm for the ILU can be found in [12]. The pseudo code is repeated 
below for completeness: 

for i = 2,...,n, do: 

for k = and for (i,k) in NZ(Z) do: 

compute Zik = z ik /z kk 

for j = k+l,...,n and for (i,j) in NZ(Z) do: 

compute Zij = — Z{ k z k j 

end do 
end do 
end do 

Here NZ(Z) is the sparsity pattern of the near-held matrix Z, and the 
conventional LU decomposition algorithm is only applied to the non-zero 
entries of the matrix, hence the memory used is not affected and the sparsity 
pattern of the stored ILU matrix is the same as the original matrix which 
further saves memory in the sparse storage of the ILU matrix. 


Performance of Preconditioned CGS Solver 

To evaluate the performance of the ILU preconditioner, we considered a per¬ 
fectly electrically conducting (PEC) ogive geometry (depicted in Fig. 1). For 
this study, the ILU preconditioner was implemented in the matrix systems 
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based on the EFIE, MFIE, and CFIE formulations. Again, the FMM near¬ 
field matrix is used as the preconditioner and the CGS iterative solver was 
used. The size of the matrix system was 480 and refers to a 10" X 2" x 2" 
ogive with its long axis coincident with the x-axis. All calculations were 
carried out at 5.91 GHz. 

Fig. 2 shows the residual error as a function of iteration number for the 
EFIE matrix. It is seen that due to the irregular mesh around the sharp 
tips of the ogive, the CGS solver does not converge in less than 50 iterations 
without preconditioning. However, when the ILU preconditioner is used, 
convergence is dramatically improved, needing only and A/50 iterations to 
achieve an error of 10“ s . 

Non-preconditioned and preconditioned solution data for the MFIE ma¬ 
trix are given in Fig. 3. Since the MFIE formulation produces better condi¬ 
tioned systems, the residual error behavior of the MFIE is better than that 
of the EFIE. Nevertheless, convergence without preconditioning is very slow. 
When the ILU preconditioner is included we reach convergence down to 10 -5 
in about N /150 iterations. Fig. 4 gives corresponding convergence curves for 
the CFIE matrix. Since the CFIE system is better conditioned, convergence 
is now achieved even without preconditioning. Use of ILU simply improves 
the convergence again from N /25 down to N /150 to reach an error of 10 -5 . 

Table 1 summarizes the performance of the ILU preconditioner for a 
larger problem. The scatterer in this simulation has sharp edges and tips as 
well as smooth sections. Also, the mesh is quite distorted and non-uniform 
around these edges. Nevertheless, the performance of the ILU precondi¬ 
tioner is quite impressive. Specifically, ILU improved the convergence of the 
CFIE (a = 0.5) matrix down to N/10 ,000. leading to a solution time of only 
5 minutes for a 53,000 unknown system on an 8-processor SGI Origin 2000. 

Based on the above performance evaluations, we can conclude that the 
ILU preconditioner can be used to improve the performance of iterative 
solvers in FMM implementations without increasing memory utilization for 
the preconditioner matrix. 
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Number 

of 

Unknowns 

Matrix 

Fill 

Time(min) 
(8 proc.) 

Precond. 

LU 

Time(min) 
(1 proc.) 

Number 

of 

Iter. 

Residual 

Error 

Time 

per 

Solution(min) 
(8 proc.) 

53000 

77 

81 

5 

0.001 

5 


Table 1: Performance of ILU for a large-scale complex target with sharp 
edges an tips on 8 processor SGI Origin 2000. 
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Abstract 

The FastScat(TM) scattering code has been adapted to solve RCS problems involving 
targets comprised of penetrable materials. Like its PEC-only predecessor, this code enjoys 
the simplicity and performance benefits provided by a high-order, point-based discretization 
and a scalable, parallel implementation of a multi-level, fast multipole method solver. 


1 Introduction 

The essential features of the general 3c? scattering problem are contained in the more restricted 
problem of scattering from perfectly conducting (PEC) targets. Methods that are useful for 
solving this problem can often be profitably employed, with straightforward modification, to solve 
the general scattering problem. 

These principles guided the initial development of Fast Scat, a frequency domain, boundary 
integral equation scattering code created over the past several years at HRL Laboratories. By 
restricting the target class to PECs, we could focus on developing and applying algorithmic 
advances to the practice of solving large scale RCS problems. The result was an improved tool 
for computational electromagnetics. The algorithmic advances incorporated into the PEC-only 
version of FastScat include a high-order, point-based (Nystrom) discretization and an efficient, 
scalable parallel implementation of the multi-level FMM (fast multipole method). Previously- 
reported results[l] demonstrate that this code achieved its goal of greatly extending the range 
and improving the accuracy and efficiency of practical scattering calculations involving PECs. 

Motivated by a desire to compute scattering from more realistic targets, we have recently 
rewritten our scattering code to be able to handle penetrable regions in addition to PECs. Al¬ 
though the transition to more general boundary conditions necessitated the addition of some new 
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data structures and involved some code restructuring, the numerical algorithms that form the 
basis of the PEC-only code carried through to the materials code with little or no modification. 
And they are similarly effective. 

This paper is an overview of our method for computing scattering from targets comprised 
of homogeneous, isotropic regions (which includes dielectrics, PECs, impedance surfaces, and 
partially conducting surfaces). We describe the modifications that were needed to make the basic 
algorithms of the PEC-only code work with multiple regions and complex dielectric constants 
and present some preliminar} 7 results obtained with the materials code on simple targets. The 
presentation presupposes familiarity on the part of the reader with the multi-level FMM and 
high-order, point-based discretizations. Introductory presentations of these topics may be found 
in [2], [3], and [4]. Parallel implementation issues are discussed in [1]. 


2 Formulation 

The two main differences between the penetrable material scattering problem and the PEC-only 
problem are: 1) the target may contain multiple volumes with different propagation constants 
and 2) at the interfaces between dielectric volumes it becomes necessary to compute magnetic 
currents 1 (corresponding to tangential electric fields) in addition to the usual electric currents 
(corresponding to tangential magnetic fields). 

One can think of the composite scattering problem as that of several separate scattering 
problems, one for each penetrable volume, that are coupled by means of the boundary conditions 
on the fields at the interfaces between adjoining volumes. The equivalent sources J and M on 
(the inside of) the boundary S of a given volume V are related to the exciting fields E* and H 1 
coming from the interior of V by 

-n x E’ = [tLJ+ (k + I j M (1) 

and 

n x IT = -eLM+ (k + A J, (2) 

where e and p are the dielectric constants of the volume, J = n x H, M = — n x E, n is the unit 
normal pointing into the volume, L and K are integral operators given by 

L = n x J ds' uo (l + WJ G (x, x') • (3) 

and 

il = nx J ds ' V'G(x,x') x, (4) 

Whereas the computed electric currents represent physical currents in the PEC case, both the magnetic and 
electric currents should be regarded merely as equivalent currents at certain interfaces, such as those between 
dielectrics. 
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and the kernel G is defined by 


G(X ’ X ' )S 4^ : r = l x - x 'l 


Note that the propagation wavenumber 


(5) 


k = y/eiiu) ( 6 ) 

is specific to the dielectric properties inside the volume. 

At an interface between two volumes, the relationship between J and M on either side of the 
interface is given by the boundary conditions. For example, at the interface between two dielectric 
volumes, the requirement that the tangential components of E and H be continuous across the 
boundary is equivalent to the statement that J and M must be opposites on the two sides. In 
this case, J = J + = — J - and M = M + = —M - can be determined by integral relations such as 

-n x E 1 ’* = ± (ttLf J+ ('±K ± + Ij M (7) 

and 

g x H’-* = T (eLf M+ (±K* + |) J, (8) 

where the db superscript signifies the volume with which the source, field, or integral operator is 
associated (as written, the + volume is the one into which a surface normal, uniquely defined on 
the boundary, points). There are different ways to combine these equations into a CFIE, We used 


the particular combination 

-n x (E’ ,+ + E’’") = ((^I) + - J+ (K + - K~ + l) M (9) 

n x (&■+ + H‘>-) = - (( eL) + - (el)') M+ (K + - K~ + 1) J (10) 

which reduces to 

-n x (E J ’ + + E*’”) = M (11) 

n x (H t,+ + H 1 ’-) = J (12) 


in the limit as the dielectric properties on the two sides become identical. 

In some other cases, the E and H fields are discontinuous across the boundary (e.g. dielectric 
volumes separated by a partially conducting sheet), in which case the integral relations between 
J and M take a slightly different form. In all cases, however, one needs only to evaluate the L 
and K operators and the excitations (which may be zero) on one or both sides of the boundary 
to determine J and, if needed, M. 
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3 Implementation 

In the PEC-only code, two unknowns are associated with each discretization point on the surface. 
Each unknown corresponds to the amplitude of the electric current J in one of two independent 
surface tangent directions. The discretization points are distributed according to a high-order 
quadrature rule on a high-order representation of the physical scattering surface. 

The method for numerically evaluating and applying the integral operators L and K is different 
depending on whether the operation connects field and source points that are nearby of far apart. 
Near interactions (including self interactions) are computed using locally-corrected quadrature 
rules. Local corrections restore high-order convergence to a quadrature rule in the presence of 
kernel singularities that would otherwise destroy the rule’s high-order behavior. Since they are 
needed only for nearby interactions, the matrix of local corrections is sparse. It can be saved and 
reused. 

Far interactions are computed by means of a multi-level FMM. The FMM makes use of the 
fact that relatively few multipoles are required to accurately compute interactions between well- 
separated collections of sources. By organizing sources into groups whose outgoing and incoming 
fields can be represented as sums of plane waves, one can significantly reduce the operation 
count needed to compute far interactions (e.g., with N interacting sources, the operation count 
is O ( N 3 ^ 2 ) for a single-level FMM, as compared to O (N 2 ) for the direct interaction approach). 
Expanding the fields in terms of plane waves is key to making the FMM an efficient method; 
sparsity of the FMM decomposition depends on the fact that the operation of translating plane 
waves from a source group to a field group does not mix the terms associated with different 
propagation directions. The multi-level FMM obtains an additional speedup by organizing small 
groups (at the lowest level) into larger groups, these into still larger groups, and so on. The net 
result is that fewer plane wave translations are required and the overall operation count decreases 
(asymptotically) to O (iV log 2 ( N )). Storage requirements decrease similarly. 

The transition from a PEC-only code to a materials code required: 

• the addition of magnetic source unknowns at boundaries that can support non-vanishing 
tangential electric fields, 

• computation of two sets of local corrections (one for each volume) at interfaces between 
penetrable volumes, and 

• construction of separate FMM operators for each volume. 

Contrary to what one might expect, it is not necessary to construct separate FMM operators 
for E and H fields. The reason is that, for each plane wave term in the expansion, one field is 
simply the a 90°-rotated version of the other. 

Allowing for complex-valued dielectric constants, and therefore complex values for k, also had 
a negligible impact on the FMM. Its effect on the local corrections was not completely negligible, 
however, insofar as it necessitated that local corrections be computed not just for the real part 
of the kernel (which is singular) but also for its imaginary part (which is regular whenever k is 
real). The singularity in the imaginary part comes from the added factor of exp (—fcr), which has 
a cusp at vanishing separation for non-zero values of the imaginary part of k. 
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4 Results 


As in the PEC case, the canonical target for initial validation of a 3 d scattering code is a sphere, 
because computed solutions can be compared against Mie series solutions. In the materials case, 
the sphere may consist of one or more dielectric shells. Figure 1 compares bistatic cross sections 
obtained from the Mie series against those computed by FastScat for a four-layer sphere consisting 
of a lA-radius IBC (impedance boundary condition) core surrounded by three 0.1 A-thick dielectric 
layers. Each dielectric layer has a partially-conductive coating. The (complex-valued) material 
parameters of the layers are given in Table 1. 

The FastScat computation used a discretization produced by meshing each spherical boundary 
layer into 320 triangular spherical patches and placing a 15-point quadrature rule on each patch. 
The (admittedly over-refined) discretization that resulted had a total of 19200 sample points and 
6T200 unknowns. The RMS error of the computed solutions (as compared to the Mie series) 
are 0.001 dB and 0.005 dB (06). The calculation made use of 3 FMM levels and a CFIE 

formulation. 

The result of another calculation is displayed in Figure 2. In this case the target is a 6Ax3AxlA 
slab of uniform dielectric with index of refraction equal to 2. The graphic shows the intensity 
pattern of the equivalent electric current (i.e., surface magnetic field) for a vertically-polarized 
plane wave normally incident on the 3AxlA face. In this case, the surface was meshed into 1/4- 
A square patches and a 16-point quadrature rule (the product of two 4-point Gauss-Legendre 
product rules) was placed on each patch. The inside and outside FMM operators both had 5 
levels. 

The PEC-only version of FastScat can import high-order geometry descriptions produced 
by a CAD program. We regularly read in meshes generated by PATRAN, which can export a 
mesh composed of bicubic quadrilaterals. We are presently adapting this code to handle CAD 
descriptions of more general targets with material properties. When this task is complete, the new 
version of FastScat will be able to attack a large class of interesting problems involving realistic 
targets. We are also in the process of implementing a Nystrom-discretized, high-order description 
of waveguide apertures and excitations [5]. 


5 Summary 

The FastScat RCS code has been augmented to be able to handle targets comprised of homoge¬ 
neous, isotropic penetrable materials in addition to PECs. The new code draws on the unique and 
powerful methods of its PEC-only predecessor, such as a high-order, point-based discretization 
and a scalable, parallel implementation of the multi-level fast multipole method. Enhancements to 
the code to allow for waveguide apertures and CAD-specified geometry descriptions are presently 
in development. 
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Table 1: Material parameters for 4-layer sphere, rj (surface impedance); e and p (volume dielectric 
constants); a (surface conductivity) 
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Figure 1: Comparison of Mie series and Fast Scat solutions for a lA-radius PEC sphere with three 
0.1 A-thick dielectric coating layers. 



Figure 2: Intensity pattern of the equivalent electric current on a dielectric slab with e = 4, fi = 1. 
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Abstract 

We describe a new approach for accurate and highly efficient numerical simulation of elec¬ 
tromagnetic (EM) fields inside and in the vicinity of complex dielectric objects. Main elements 
of the approach are a novel coupled surface/volume integral-equation formulation and a modular 
solution technique consistent with the sparse block structure of the impedance matrix resulting from 
the proposed equations. The proposed approach utilizes also various impedance matrix compression 
techniques which are applied selectively for each block of the impedance matrix in a manner en¬ 
suring the optimal performance. The method is particularly well suited for simulation of EM field 
distribution in complex biological media. 


1. Introduction 

Numerical simulation of a broad spectrum of electromagnetic processes involving realistic 
dielectric objects encounters, generally, some or all of the following challenges: the geometries are 
several wavelength in size, they are highly inhomogeneous and rich in structural details of the order 
or orders of magnitude smaller than the wavelength. 

For such class of problems, the integral equation based approaches (methods of moments 
(MoM)) are known to produce results of the highest degree of accuracy. The computer memory 
and computational time resources required by MoM have been recently significantly alleviated by 
the progress in the development of the impedance matrix compression techniques. 

However, although a significant progress has been made in the application of compression 
techniques to electromagnetic processes involving simulation of perfectly conducting and resistive 
structures (which can be described in terms of surface integral equations), relatively much less 
attention has been given in the literature to the development and implementation of efficient 
integral equation techniques for electrically large and complex dielectric objects. 

In this paper: 

(1) We describe a novel coupled surface/volume integral-equation formulation suitable for large 
dielectric geometries composed of coexisting homogeneous and inhomogeneous regions. The pure 
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surface integral equations lead to block sparse structure of the impedance matrix however they 
are limited to fully describe models consisting only of piecewise homogeneous regions. The purely 
volumetric integral equations, although capable of capturing the inhomogeneities, lead to a dense 
impedance matrix. The formulation we propose is based on a representation of fields in terms of 
coupled surface currents and residual volumetric currents, and results in a block sparse impedance 
matrix. 

(2) We present an efficient solution strategy achieved through utilization of the sparse block 
structure of the resulting impedance matrix as well as the suitable impedance matrix compression 
techniques. Our solution strategy was designed to be most efficient for geometries characterized by 
a clear hierarchy in the importance of individual components of the solution (i.e. of the induced 
current). One type of geometries which, in a natural way, display a hierarchy of the importance 
of individual components, are anatomical models. Here, because of the absorption in biological 
tissues, the magnitudes of fields and currents gradually decrease with the depth inside the body. 

During the presentation we will show capabilities and efficiency of the proposed method for 
representative electrically large inhomogeneous dielectric objects. 


2. Formulation 

2.1. Coupled surface/volume integral equations 

We first introduce and describe our coupled integral equation formulation on an example (Fig. la) 
of a single material region characterized by a position-dependent relative electric permittivity 
e(r) and a magnetic permeability /i(r), surrounded by the free-space region fi 0 j e o ~ Mo = 1* 



Fig. la: A single material region sur¬ 
rounded by the free-space region Q 0 . 



Fig. lb: A representation of equivalent 
surface and volumetric currents in the 
problem of Fig. la. 
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We denote by e 1 and fa average values of the parameters e(r) and fi(r) in ftj. In Fig. la, S = S 10 
denotes the interface between the regions, and n = n 10 the unit normal vector oriented from 
to On the interface s = n 10 there may be located a thin conducting sheet characterized 
by the (generally position-dependent) electric resistivity i£, magnetic resistivity 5, and a “cross- 
resistivity” W [1]. Such a sheet may be used to model a set of thin layers with various material 
properties. 

We denote volumetric currents in region ft} by [j : im 2 ] T , and introduce two sets of equiva¬ 
lent surface currents on the interface E 10 , [ J 01 iM 01 ] T and [ J 10 iM 10 ] T , as shown in Fig. lb. 
For convenience, we multiply here the magnetic fields and currents by the imaginary unit; this 
convention results in a manifestly symmetric complex impedance matrix. 

The essence of our coupled integral equations is to represent the material parameters in dielec¬ 
tric regions as constant average values (for which the Green’s function is known), and deviations, 
which will appear in the volumetric equations. We thus write, in region f2 m , e(r) = e m + (e(r) — e m ) 
(and similarly for /x(r)), and introduce matrix-valued Green’s functions of the Maxwell’s equations 
for constant parameters e m , /x m , 


ifcoMm (1 + 4: V ® V)3 m (r) i Vg m (r) x 

i Vfl m (r) x i k 0 e m (1 + V ® V)j m (r) 


( 2 . 1 ) 


Here k m = /x m k Q is the wave number in the homogeneous medium, and g m (r) = e lfc « ^ / 47r |r| 
is the corresponding Green’s function of the Helmholtz equation. 

We represent fields in each region as due to the surface currents on the boundary of the region, 
and volumetric currents inside the region; both contributions involve the Green’s function (2.1) (we 
use here the conventional integral representations [2,3,4]). By imposing the Maxwell equations on 
the fields, we obtain then a set of coupled equations for the surface and volumetric currents, 
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and 


[iA 0 («-€ a )] 1 0 

0 [i k 0 (n - ft)] -1 



in Q 1 . (2.2c) 


Here E m and H m cure the external incident fields. The matrices Ci„ 3 C(n 10 ) and T 10 s T{ n 10 ) 
are defined as 


C(n) = 


-R 

iWnx " 

, T(n) m 2 

W 

—iRn x 

—iWn x 

-5 


|^-i5n x 

-W 


(2.3) 


and are, generally, position-dependent. The parameter p = 1/4 (i?5 + W 2 ) characterizes the 
degree of the electric penetrability of the sheet. In Eqs. (2.2) we use a compact operator notation, 
in which 




etc. 

We note that if the region is homogeneous, we have e(r) = e 1 and p(r) = In this case 
the volumetric currents, j x and m l3 must vanish, and Eqs. (2.2) reduce to purely surface equations. 

The physical meaning of the coupled system of equations (2.2) is clear: Eqs. (2.2a) and (2.2b) 
have the form of the usual integral equations for equivalent surface currents, in which the sources on 
the right-hand-side include not only the incident wave, but also fields generated by the volumetric 
currents. We note that our integral equations differ substantially from the conventional volumetric 
integral formulation, which would have the form 


[ifco(e-l)]' 1 0 [ j j 

+ ^0 

0 [iA; 0 (p — l)]" 1 J [imj [in 



( 2 . 6 ) 


The essential differences are: 

(1) The material-dependent term in Eq.(2.2c) involves only differences between the actual pa¬ 
rameters e(r) and p(r) and their average values e-L and fx x in the medium, rather than the 
ffee-space values. 

(2) The Green’s function appearing in Eq.(2.2c) corresponds to a material with parameters and 

rather than to free-space, as in Eq.(2.6). 
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(3) While in Eq.(2.6) the right-hand side represents the incident free-space wave (E m , H m ), in 
Eq.(2.2c) it is given by the field generated by the equivalent surface currents on the boundary 
of flj. 

We note that the currents and m 1 in Eqs. (2.2) are thus not the conventional volumetric currents 
j and m of Eq.(2.6), but rather “residual currents” due only to deviations of the material properties 
e(r) and /x(r) from their average values and fi v 

2.2. Structure of integral equations and the impedance matrix for complex objects 

Equations similar to (2.2) hold for objects with multiple regions and interfaces, such as the example 
of Fig. 2. Similarly to Fig. lb, the interfaces between regions are represented as “doubled”, so that 
each surface supports one pair of currents (J,M). 



Fig. 2: A system of two material regions (fix and Q 2 ) in free-space (ft 0 )» with region boundaries 
supporting surface currents. 

The most essential properties of these equations are as follows: 

(1) Surface currents on a boundary of a region couple only to: 

- Surface currents on the same boundary . Thus, in Fig. 2, currents (J 10 ,M 10 ) couple to 
(J 12 , M 12 ) through the Green’s function G 1 of region fi*. 

- Surface currents on the boundary of adjacent regions. In this way, currents (J 12 , M 12 ) 
couple to (J 21 ,M 21 ) through the material-dependent term C . 

- Volumetric currents in the considered region. Thus, currents (Ji 2 ,M 12 ) couple to (j l7 rrix) 
through (?x. 

(2) Volumetric currents in a region couple only to: 

- Themselves , with the material-dependent term in the volumetric equations (2.2c) and the 
region’s Green’s function. E.g., currents (jjjiiij) couple only to themselves, and no other 
volumetric currents. 
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- Surface currents on the boundary of the region with the Green’s function of the region. 
Thus, (j l5 rrii) couple to (J 10 ,M 10 ) (J 123 M 12 ) with 

(3) Incident fields couple only to the surface currents on the interface with the free-space region. 

The most important of the above features is that the couplings between the currents are local 
- limited to the same region or adjacent regions. 

After Galerkin discretization with locally-supported surface and volumetric basis functions by 
using conventional basis functions [5,6] defined on triangles and tetrahedra, we obtain a matrix 
equation 

Ax = b, (2.7) 

where x represents the unknown currents, b the incident fields, and A is the impedance matrix. 
Since, as we discussed above, there are only couplings of currents on adjacent interfaces and in 
adjacent regions, the matrix A is block sparse. 

The block-sparse structure of the impedance matrix in our formulation contrasts with the 
purely volumetric formulation, which involves mutual couplings of all current elements in the 
entire object, and leads to a dense impedance matrix. 


3. Solution Strategy 

The structure and the block-sparseness of the impedance matrix following from the coupled sur¬ 
face/volume formulation presented above offers interesting possibilities of developing novel ap¬ 
proaches to the impedance matrix compression and to iterative solution of the problem. 

We describe here an iterative solution technique applicable, in particular, to geometries which 
exhibit a clear hierarchy in the importance of the individual components of the solution (i.e., of 
the induced currents). As an example of such geometries, one may consider anatomical models. 
There, due to the absorption in the tissues, the magnitude of the fields and currents gradually 
decreases with the depth inside the body. It is therefore natural to order the components of the 
objects, i.e., its volumetric regions and the interfaces between them, according to the magnitude 
of the currents they support. 

Given the hierarchical ordering of the geometry components, and thus of the blocks of the 
current vector, a straightforward iterative solution method would be to: (i) solve for the currents 
on the first geometry component (in the absence of all the other components); (ii) use the currents 
on the first component as sources of the field incident on the second component, and solve for 
currents on the second component; etc. Then, if the desired accuracy has not been reached, repeat 
the entire process with the computed currents used as the starting point. The heuristic justification 
of the procedure is that, whereas the larger currents strongly affect the smaller currents (on the 
next component), the reverse influence is much weaker. 
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The solution procedure just described is precisely the classical block Gauss-Seidel method, in 
which the new approximation to the solution, x\ is related to the previous approximation x by 

x' = (I-L^A)x + L^b = Kx + Nb i (3.1) 

where L A is the lower-block-triangular part of A ( including its diagonal blocks). 

The iterative solution obtained from Eq.(3.1) is then the series 

z = (I + K + K 2 + K* + ...)Nb , (3.2) 

which may or may not converge, depending on whether or not the spectral radius of the matrix 
K is less than 1. This obvious deficiency of the block G-S method becomes clear when we realize 
that Eq.(3.2) is the Richardson-method solution of the left-preconditioned equation 

L?Ax = L?b, (3.3) 

with L~£ playing the role of the preconditioner matrix. The solution converges only when the 
matrix L~ A A is sufficiently close to the identity matrix. 

This deficiency can be, however, remedied, if we observe that it is associated only with the 
Richardson solution method, and not with the equation (3.3) itself. We can thus solve Eq.(3.3) by 
using any other iterative method whose convergence is guaranteed, such as the GMRES algorithm. 

In practice, it is more convenient to use right-preconditioning. We thus solve the system of 
equations 

A L~ A x — b, x = L^x , (3.4) 

where is the preconditioner matrix. We remark here that the preconditioner in Eq.(3.4) can 
be regarded as the inverse of the very simple approximation to the matrix A, A a L A (which is a 
reasonable approximation under the assumption of the hierarchy of the solution components). 

A possible alternative preconditioner, U A x D A L~ A (where U A and D A are the block-upper- 
triangular part and the block-diagonal part of A ), would result from the incomplete block LU 
decomposition of A, A a L A D^ U A . 

For large problems it would become prohibitively expensive to compute the matrix ex¬ 
plicitly. Thus, we developed the following scheme. We implement the preconditioner-vector mul¬ 
tiplication, say v = L a u, by iteratively solving the system L A v — u] this entails a sequential 
solution of a set of equations with diagonal blocks of A, and forward-substitution steps involving 
the off-diagonal blocks of A. The preconditioned iterative solution of the full system can be thus 
represented as 

x = {£ A }; 1 {A{i A }; 1 }; 1 6, (3.5) 

where v = {A}^ 1 u denotes an iterative solution of the equation Av = u, computed with the 
tolerance A, and similarly for v = u. Thus, 6 and A in Eq.(3.5) are tolerances of the inner 

and outer iterations. 
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We note that since the computational cost of the block preconditioner-vector multiplication, 
v = u associated with Eq. (3.5) is in general higher than the cost of the impedance matrix- 
vector multiplication v = Au, its use is advantageous if it leads to a significant reduction the 
number of outer iterations. 

The application of our block preconditioner is even more advantageous in the out-of-core 
solution, in which individual blocks of the matrix A are stored on disk, and retrieved when needed. 
The gain results precisely from the fact that, for our block preconditioner, the cost of disk operations 
is approximately the same for the matrix-vector and preconditioner-vector multiplications, but the 
CPU time is higher for the latter. Therefore, the relative disk-operations overhead is lower for the 
preconditioned solution. 


4. Numerical Examples 

We have carried out extensive validation tests of the proposed method and were able to obtain 
highly accurate field distributions inside complex dielectric media modeled in terms of several 
hundred thousand of unknowns. For example, a field distribution within a complex anatomical 
model of 250,000 unknowns was obtained, with the residual tolerance of 0.01, in only 3 global 
(outer) iterations. Different impedance matrix compression techniques were used in for different 
regions, depending on their geometrical and electromagnetic characteristics. Compression methods 
were selected automatically by the code on the basis of the estimated matrix-fill and matrix-vector 
multiplication costs. Additional results of our analysis will be presented at the conference. 
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1. Introduction 

We are interested in determining the electromagnetic fields within closed perfectly conducting cavities 
that may contain dielectric or magnetic materials. The vector Helmholtz equation is the appropriate 
partial differential equation for this problem. It is well known that the electromagnetic fields in a 
cavity can be decomposed into distinct modes that oscillate in time at specific resonant frequencies. 
These modes are referred to as eigenmodes, and the frequencies of these modes are referred to as 
eigenfrequencies. Our present application is the analysis of linear accelerator components. These 
components may have a complex geometry; hence numerical methods are required to compute the 
eigenmodes and eigenfrequencies of these components. 

The Implicitly Restarted Amoldi Method (IRAM) is a robust and efficient method for the numerical 
solution of the generalized eigenproblem Ax = ABx, where A and B are sparse matrices, x is an 
eigenvector, and X is an eigenvalue. The IRAM is an iterative method for computing extremal 
eigenvalues; it is an extension of the classic Lanczos method. The mathematical details of the IRAM 
are too sophisticated to describe here; instead we refer the reader to [1]. A FORTRAN subroutine 
library that implements various versions of the IRAM is freely available, both in a serial version 
named ARPACK and parallel version named PARPACK. 

In this paper we discretize the vector Helmholtz equation using 1 st order H(curl) conforming edge 
elements (also known as Nedelec elements). This discretization results in a generalized eigenvalue 
problem which can be solved using the IRAM. The question of so-called spurious modes is discussed, 
and it is shown that applying a spectral transformation completely eliminates these modes, without any 
need for an additional constraint equation. Typically we use the IRAM to compute a small set (n < 30) 
of eigenvalues and eigenmodes for a very large systems (N > 100,000). 

2. Problem Formulation 

We are interested in solving the vector Helmholtz equation in a 3-dimensional inhomogeneous volume 

Q, 


V x }L v V x E - 0 ) 2 sE = 0 in Q ., 


( 1 ) 
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with boundary condition n x E - 0 on dH, where E is the electric field vector, fi and £ are the tensor 
permeability and permittivity, and#is the radian frequency. Employing the Galerkin procedure using 
1 st order edge elements denoted by . results in the generalized eigenvalue problem 


Ae = co 2 Be (2) 

where e is the A^-dimensional vector of degrees-of-freedom and the matrices A and B are given by 

(3) 

a n 

The details of computing the matrices can be found in most finite element textbooks, for example 
[2][3]. Using terminology from continuum mechanics, the matrix A is referred to as the stiffness 
matrix and the matrix B is referred to as the mass matrix. When fi and £ are real, the matrices A and 
B are symmetric; the matrix A is semi-definite and the matrix B is positive-definite. 

3. A Comment on Spurious Modes 

Equation (1) admits to two types of solutions; irrotational field solutions and solenoidal field solutions. 
An irrotational field is the gradient of a scalar potential function 

E ir = -V^. (4) 

Inserting (4) into (1) we see that co = 0 for irrotational fields. Conversely, by taking the divergence of 
(1) we see that if co * 0 then the field must be solenoidal, 

0. (5) 

Since the permittivity may not be continuous, equation (5) is best understood in the weak sense: we 
multiply (5) by a scalar potential <p that is zero on the boundary, and then integrate over the domain Q. 
and employ the divergence theorem to yield 

j£E s = ( 6 ) 

£2 

Equation (6) states that a solenoidal solution of (1) is orthogonal to every irrotational solution. 
Solutions of (1) therefore can be decomposed into irrotational (co = 0) and solenoidal (co & 0) 
solutions, with every solenoidal solution being orthogonal to every irrotational solution. An important 
property of the vector finite element method is that the discrete Helmholtz equation (2) has the same 
decomposition of solutions as the original PDE. Let L h c H x be the set of standard bi-linear nodal 
finite element basis functions, and let the discrete scalar potential be an element of L h . It can be shown 
that the finite element spaces L h and k are related by VL h e W h ; therefore the gradient of every nodal 
basis function can be written as a linear combination of edge functions [4]. Because of this there exists 
exactly K discrete irrotational fields that are gradients of discrete scalar potentials, where K is the 
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number of internal nodes in the mesh. It can also be shown that these discrete irrotational fields form 
the null space of the stiffness matrix A; there are exactly K solutions of (2) with co = 0. These are the 
so-called spurious modes; they are static solutions of (2) with non-zero divergence. They are 
mathematically valid solutions of (2), but they are physically uninteresting. We are interested in the 
solenoidal solutions of (2). As in the continuous case, the discrete solenoidal solutions of (2) are 
orthogonal to the discrete irrotational solutions according to the inner product u T Bv; this is a basic 
property of symmetric generalized eigenvalue problems. 

4. Application of ARPACK for Electromagnetic Eigenvalue Problems 

In this section we discuss the application of ARPACK, which is a specific implementation of the 
IRAM for computing the solenoidal eigenvectors and corresponding eigenvalues of (2). ARPACK 
provides several different subroutines depending upon whether the mass matrix B is the identity or 
not, whether the matrices A and B are symmetric or not, whether A and B are complex valued or not, 
etc. The mass matrix B would reduce to the identity matrix if Cartesian grid finite difference 
discretization of the vector Helmholtz equation was used. The matrices A and B would be complex 
valued and non-Hermitian if // and e were complex valued, representing lossy dielectric and magnetic 
materials. In our case A and B are real symmetric matrices. 

The IRAM is an iterative method for computing a small set of extremal eigenvalues. In ARPACK the 
user can select to compute largest algebraic eigenvalues, smallest algebraic eigenvalues, largest 
absolute value eigenvalues, or smallest absolute value eigenvalues. ARPACK requires 
N • 0(m) + 0(m 2 ) storage where N is the dimension of the system and m the desired number of 
eigenvalues to compute. The basic user-specified parameters are: 

a) N, the dimension of the system 

b) m, the desired number of eigenvalues 

c) WHICH, a character string denoting which eigenvalues to compute 

d) ncv, the number of Lanczos basis vectors to use 

e) tol, the numerical tolerance used to determine convergence 

f) maxit , the maximum number of iterations 

The number of Lanczos vectors must be at least m+ 1. The optimal choice of ncv with respect to m is 
problem dependent and experimentation is required. If the eigenvalues are well separated then 
ncv ~ 2 • m is acceptable, where well separated is defined as 

for all j*i with C»e M . 

The numerical tolerance tol is used in the stopping criteria \X C - X t \ < tol• \X t \ , where 
X c is the computed eigenvalue and X, is the exact eigenvalue closest to X c . It is tempting to make tol a 
very small number, however this increases the run time of the problem. If tol is too small convergence 
may not occur. It is important to remember that the eigenvalue problem (2) is itself an approximation 
to physical reality, so there is little point in computing the eigenvalues of (2) exactly. 
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The parameter maxit specifies the maximum number of iterations, or restarts, of the IRAM. The 
iteration process begins with an initial vector v,, which is usually chosen at random. An Amoldi 
factorization is computed, and the ncv eigenvalues of the ncv x ncv Amoldi matrix are computed. This 
represents one iteration. The salient feature of the IRAM is the ability to automatically repeat this 
process with improved initial vectors v i5 where each new v* is determined by application of a 
polynomial in A to the starting vector v,. The repeated update of the starting vector through implicit 
restarting is designed to enhance the components of this vector in the direction of the desired 
eigenvalues and damp its components in the unwanted directions. The parameters tol and maxit are not 
independent; a large number of iterations may be required to converge with a small tolerance. 
Fortunately neither tol nor maxit affect the required amount of memory, these parameters only affect 
the run time of the calculation. 

ARPACK does not require that users actually provide the matrices A and B; instead all that is required 
is the action of these matrices. Specifically, the user must provide Ax -> w, Bx z, and R ~ l x y . 
For small systems the mass matrix B can be factored, however since we are interested in large 
problems we employ iterative Krylov-type methods for computing B~'x — »y . If iterative methods are 
used for B -I x -» y it is essential that the residual be significantly smaller than the requested 
eigenvalue tolerance tol . 

We are interested in computing only a few of the solenoidal eigenvectors and corresponding 
eigenvalues of (2). Based on the discussion of spurious modes we know that the distribution of the 
eigenvalues is as shown in Figure 1. 



As illustrated in Figure 1, the solenoidal modes are not extremal and the IRAM cannot be directly 
applied to (2). Instead, a shift-and-invert spectral transformation is applied to enhance convergence to 
the desired part of the spectrum. If (x,A) is an eigenpair of (A,B), and a * X , we form a new 
eigensystem 
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Bx = y(A - oB)x 
1 

with y =-—. 

A~G 


( 8 ) 


We choose G to be between 0 and SI, the smallest non-zero eigenvalue. The algebraically largest 
eiganvalues of the transformed system (8) correspond to the eigenvalues imediately to the right of G 
in the original system (2). The zero-valued eigenvalues of the original problem are now the 
algebraically smallest eigenvalues of the transformed system. This is illustrated in Figure 2. The 
IRAM is applied to the transformed system computing the k algebraically largest eigenvalues and 
corresponding eigenvectors. The eigenvectors of (8) and (2) are identical. Once found, the eigenvalues 
of the original problem are computed via 


A- = G H-. 

Yi 


(9) 


N-K eigenvalues with 
soleniodal eigenvectors 



A 



s7 s6 s5 s4 s3 


K eigenvalues with 
irrotational eigenvectors 


s2 


si 


Figure 2: Spectrum of the transformed eigensystem 

The shift-and-invert spectral transformation results in a new eigensystem where the desired 
eigenvalues are very well separated from the spurious eigenvalues. There is no need to add an 
additional constraint equation such as done in [5], nor is there any need to modify ARPACK. A 
disadvantage of this approach is that the user must have some knowledge of the problem to wisely 
choose G. In addition, it is now necessary to provide the action of (A - x —> y. The matrix 
A - dB is indefinite and may be ill conditioned. However it is no more ill conditioned than the 
matrices that arise in standard frequency domain finite element electromagnetics where the frequency 
is a user-specified parameter and the right-hand side of (1) is non-zero. 
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5. Results 

5.1 A Simple Sphere 

The first problem is that of a simple homogeneous sphere. Although this seems trivial, it is in fact a 
difficult problem from a numerical point of view due to the numerous degenerate eigenvalues. The 20 
smallest eigenvalues for a 36 ceil per radius sphere are shown in Table 1, along with the exact 
solution. The sphere had a radius of 0.05855m and the speed of light is unity. The mesh had a total of 
55296 hexahedral cells and the dimension of the eigensystem was N- 162528, the number of internal 
edges in the mesh. A sigma of a = 2000.0 was used to perform the shift-and-invert spectral 
transformation. 


Mode 

Exact 

Computed 

Percent Error 

TMll 

2196.39 

2200.44 

0.184 

TM11 

2196.39 

2200.44 

6.184 

TMll 

2196.39 


0.184 

TM21 

4368.84 

4382.21 

0.306 

TM2I 

4368.84 

4382.21 

0.306 

TM21 

4368.84 

4384.45 

0.357 

TM21 

4368.84 

4384.45 

0.357 

TM21 

4368.84 

4384.45 

0.357 

TE11 

5888.69 

5911.07 

0.380 

TE11 

5888.69 

5911.07 

0.380 

TEil 

5888.69 

5911.07 

0.380 

TM31 

7214.14 

7248.11 

0.470 

TM31 

7214.14 

7248.21 

0.470 

TM31 

7214.14 

7248.21 

0.470 

TM31 

7214.14 

7248.21 

0.470 

TM31 

7214.14 

7252.40 

0.530 

TM31 

7214.14 

7252.40 

0.530 

TM31 

7214.14 

7252.40 

0.530 

TE21 

9688.19 

9731.82 

0.450 

TE21 

9688.19 

9731.82 

0.450 


Table 1; Exact vs computed eigenavlues (co 2 ) for 36 cell per radius sphere 


The data in Table 1 was computed using the following ARPACK input parameters: 

a) N= 162528 

b) 772=20 

c) WHICH = largest algebraic eigenvalues 

d) ncv — 40 

e) fo/=1.0e-5 

j) max it = 300 

Although maxit was set to 300, only six iterations were required for convergence of all 20 eigenvalues. 
The Jacobi-preconditioned conjugate residual method was used to evaluate (A - oB)" 1 * y with a 
residual tolerance of 1.0e-9. The total run time was 25 hours on a Compaq AlphaServer 8400 with a 
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theoretical peak performance of 880 Mflops. The run time was dominated by the cost of the conjugate 
residual method and not by ARPACK or by the calculation of the finite element mass and stiffness 
matrices. 

5.2 A Linear Accelerator Induction Cell 

The second problem is to compute the lowest eigenmodes of a linear accelerator induction cell. Of 
particular interest is the magnitude of the electric field in the accelerating gap, as this determines 
whether or not the particular mode will couple with the electron beam. A 33024 cell hexahedral mesh 
is used to model the induction cell. Part of the cell is vacuum, and another part consists of oil with a 
relative permittivity of s r -45. The input parameters to ARPACK were the same as for the sphere 
problem above except that N = 90237 for this problem. Based on back-of-the-envelope estimations, a 
sigma of a — 0.001 was chosen for the shift-and-invert spectral transformation. A total of 5 ERAM 
iterations were required for convergence of all 20 eigenmodes. The total run time was 15 hours on the 
same Compaq AlphaServer 8400. The computed eigenvalues are shown in Table 2, naturally for this 
problem there is no analytical solution for comparison. The column labeled co 2 is the computed 
eigenvalues of (2) using unity speed of light, and the column labeled Mhz is the calibrated resonant 
frequencies of the induction cell. 

Figure 3 shows the 1 st , 5 th , 13 th , and 20 th eigenmodes of the induction cell. The figures show electric 
field magnitude. Although it may be difficult to discern from the figures, the 1 st and 20 th modes have 
maximum field values in the accelerating gap and hence will couple strongly with the electron beam, 
whereas the 5 th and 13 th modes are examples of modes that will not couple strongly with the beam. 


co 2 

Mhz 

.00495 

132.200 

.01158 

202.166 

.01158 

202.166 

.01721 

246.471 

.02222 

280.015 

.02222 


.02930 

321.560 

.02930 

321.560 

.03585 

355.683 

.03585 

355.683 

.05206 

428.607 

.05206 

428.607 

.05466 

439.197 

.05917 

456.955 

.05921 


.05929 

457.417 

.05929 

457.417 


498.881 

.07053 

498.881 

.08034 

532.453 


Table 2 : The 20 lowest eigenvalues and resonant frequencies for the induction cell 
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Figure 3: Selected eigenmodes for the linear accelerator induction cell. The geometry is clipped 
so that the internal structure of the modes can be seen. 
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Fast Fourier Transform of Functions with Jump Discontinuities 
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Abstract — A fast algorithm is presented for the evaluation of the Fourier transform 
of piecewise smooth functions with uniformly and non-uniformly sampled data by using a 
double interpolation procedure. The complexity of algorithm is 0(Np + iVTog(jV)) where 
p is the interpolation order. Numerical experiments demonstrate the performance of the 
algorithm. 


I. Introduction 

The Fast Fourier Transform (FFT) algorithm, as one of most fundamental and pow¬ 
erful mathematical tools, has been widely used in computational science and engineering 
in general, and computational electromagnetics and signal and image processing in partic¬ 
ular. As is well known, for a smooth periodic function, the FFT provides a high accuracy. 
However, if the function has singularities, such as jump discontinuities, the accuracy of the 
FFT results will be greatly reduced. It is easy to understand that this error is a result of 
the trapezoidal rule in the discretization of the Fourier transform integration. Sorets pro¬ 
posed a Fast Fourier Transform algorithm of piecewise constant functions via the Gaussian 
quadrature and Lagrange interpolation, together with an application of the standard FFT 
[1]. Beylkin also addresses the FFT of functions with similar singularities using piecewise 
polynomial splines [2]. 

In this paper, we develop an algorithm for piecewise smooth functions by using a 
double interpolation procedure. With the help of the double interpolation and Gaussian 
quadrature, the algorithm can be applied to both uniform and nonuniform sampled data. 
Hence, this algorithm can also perform nonuniform fast Fourier transform for unequally 
spaced data [2-6]. In the following sections, the formulation of algorithm is developed, 
followed by the implementation procedures and complexity analysis. Finally, we show the 
numerical results to demonstrate the performance of the algorithm. 


This work was supported by the U.S. EPA through a PECASE grant CR-825-225-010, 
and by the NSF through a CAREER grant ECS-9702195. 
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II. Formulation 


Consider the Fourier transform of a piecewise smooth function f(x) 

1 

/(n) = J f(x)e~ i2ltnx dx —N/2 <n< N/2 - 1 (1) 

0 


where n is an integer. We divide the integration interval [0,1] into L subdivisions in 
which f(x) is a smooth function. Step discontinuities are allowed at the boundaries of 
these subdivisions. In each subdivision, there are at most p uniformly or non-uniformly 
sampling points, where p is referred to as the order of interpolation as shown later. Then 
(1) can be written as 


/(») = j /(*)< 


: dx 


( 2 ) 


where {xi}(l = 1 ,••*,£) are the end points of the subdivisions. Using the change of 
variables 


t t Xi+Xi- 1 Xi-Xi- 1 

t = a + ox =---1--- x 


and the Gaussian Legendre quadrature 



Q 

dt -^2 /(**) u k 

k~\ 


( 3 ) 

( 4 ) 


where £*. and Wk(k = 1 , • • • , q) are the Gaussian-Legendre nodes and weights, respectively, 
we obtain 


/(«)»£* (5) 

{=1 n =1 

In the above expression, the values of both function f(x) and the exponential function 
at the nonumform distributed points {t l k } can be obtained via the Lagrange interpolation, 


where 


g(x) — gip^m) ^m( 2 ') 


S m{x) = II 


X m - x n 


( 6 ) 

( 7 ) 
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Here we perform this interpolation separately for f(x) with order pi and exp(— ttirnx) 
with order p. 

Note that we use this double interpolation scheme because these two functions to be 
interpolated have different properties. The former is a piecewise function, while the latter 
is a periodic analytic function but highly oscillating when n becomes large. Therefore we 
normally interpolate the two functions with different sets of sampling points and orders of 
interpolation. For the exponential function, we use a uniform grid in order to apply the 
uniform FFT for the discrete summation; and we always choose the interpolated points to 
locate at the center of the interpolation region to obtain the highest accuracy for a p-th 
order Lagrange interpolation. With this double interpolation, (6) becomes 


1=1 k =1 




mi=1 




( 8 ) 


Transforming the local coordinate to global coordinate and changing the order of 
summation, we finally obtain 

uN 

/w = E^ e " i2,rnXm w 

m=l 


where 

9m = bl ( 5Z /(4n) 5 >m(4) ) S m(t‘k) (10) 

1 =1 k=l \mi-l / 

and v is the oversampling factor. Since {x m } are now on a uniform grid in [0,1], (9) can 
be efficiently evaluated via a standard uniform FFT algorithm. 

It is worth pointing out that the above double interpolation procedure has several 
advantages. First, the double interpolation can ensure the interpolation points for the 
exponential function to locate at the center of the interpolation region, leading to the 
highest accuracy for a given Lagrange interpolation order. In contrast, if a single inter¬ 
polation were applied to the product of these two functions, the periodic property of the 
exponential function could not be used because of the piecewise property of the function 
to be transformed. Furthermore, the locations of the interpolated points would not be 
always located at the center of the interpolation region. The accuracy of the interpolation 
for the product, hence the accuracy of the FT, would be reduced, especially when n be¬ 
comes large. Second, since the Lagrange interpolation allows the nonuniformly sampling, 
the above algorithm can be applied to both uniform and nonuniform sampled data with 
the same number of arithmetic operations. In contrast, a single interpolation allows only 
uniform sampled data in order to use uniform FFT to evaluate the summation. Third, 
the double interpolation allows a lower order interpolation, i.e., a lower sample density, for 
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the slowly varying function f(x). Finally, the double interpolation allows other efficient 
algorithms, such as the fast multipole method (FMM) [7], to be incorporated readily into 
the interpolation procedure. 

Implementation of the algorithm includes following steps: 

1) Initialization of 6 mi (tj.) and S m (t l k ). This preprocessing is needed only once and 
has a complexity of O(Npi) and 0(Np 2 ) respectively, taking q = p; 

2) Calculation of g m in (10). The complexity is O(Np): 

3) Calculation of f(n) in (9) by a standard FFT. The complexity is O (TVlog(TV)). 
The total complexity is 0(Np + TVlog(TV)) for steps 3 and 4. The preprocessing step 

needs to be done only once. 


III. Numerical Results 

The algorithm described above has been implemented using Fortran and matlab pro¬ 
grams. To illustrate the accuracy of the algorithm, we take two functions as examples. The 
functions to be transformed in example 1 and 2, as shown in Figs. 1 and 2, are respectively 
defined as 


! x — X\ 
X2-X1 
0 


and 


h{x) = 


a + sm(27tpx) 


0 


X\ < X < X 2 
elsewhere 

Xj ^ X ^ x 2 
elsewhere 


(ii) 


( 12 ) 


In the calculations, we take v = 4, p = 32, and p\ = 2 (Fig. 1) or pi = 16 (Fig. 2). 
Both uniformly and nonuniformly sampled data are used in each example. The nonuni- 
formly spaced points are randomly distributed. We compare the numerical results with 
the analytical solutions. The error is defined as 




max \f{n)-f a (n)\ 

-N/2<n<N/2-l Wl 


(13) 


where / a (n) is the analytical solution. The difference between the results for the uniformly 
and nonuniformly sampled data is negligible. Table 1-3 show the accuracy and computer 
time of the algorithms. The CPU times is obtained on a SUN Ultra 1 workstation. For 
comparison, we also give the results of a direct application of FFT for a uniform grid with 
size TV. 
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Finally, similar to a standard 2-D FFT algorithm, we calculate an example of 2-D FFT 
by the row-column deposition along with the above 1-D FFT algorithm. The function to 
be transformed in example 3 is 


/(*,») = h (*)Mv) (14) 

where fi and / 2 are defined in (12) and (13). In this example, the precalculations in both 
x- and y-direction 1-D FFTs are needed only once. Table 4 compares the accuracy of our 
algorithm with those obtained by a standard 2-D FFT. 

In all examples, we note that with a comparable CPU time (excluding preprocessing 
which is needed only once), our algorithm is 10 order of magnitude more accurate than 
the standard FFT for double precision computations. 

IV. Conclusion 

In conclusion, we presented a fast algorithm for the evaluation of Fourier transform of 
piecewise smooth functions. By using a double interpolation procedure, the algorithm can 
be applied to both uniform and nonuniform sampled data. The complexity of algorithm is 
0(Np + Nlog(N)) plus 0(Np 2 ) for precalculation. The numerical experiments show the 
accuracy and efficiency of the algorithm. This algorithm is useful for signal processing and 
for simulations. 
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X 


Figure 1. A triangle function for 0.25 < x < 0.5. 



x 


Figure 2. A sinusoidal function for 0.2 < x < 0.8. 
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Table 1. Errors and run times for example 1 (double precision) 


N 

Errors (Eoo) 

Timings (sec) 



Direct 





64 


9.683e-03 

0.198 

0.006 


0.002 

128 



0.346 



M ■ 

256 

1.130e-13 

2.408e-03 

0.768 

0.024 

0.037 

0.008 

512 

1.120e-13 


1.535 

0.049 


0.017 


Table 2. Errors and run times for example 1 (single precision) 


N 

: Errors (Eoo) 

Timings (sec) 


This paper 

Direct 

Init. 

Eval. 

FFT 

Direct 

64 

1.856e-07 

9.683e-03 

EMI 

0.002 



128 

6.038e-07 

4.824e-03 

mm 

0.004 

0.016 

H 

256 

6.747e-07 

2.408e-03 

0.104 

0.007 

0.036 

0.008 

512 

7.058e-07 1 

1.203e-03 

EES! 





Table 3. Errors and run times for example 2 (double precision) 



Errors (Eoo) 

Timings (sec) | 


Direct 

Init. 

Eval. 

FFT 

Direct 

512 

1024 

1.471e-ll 

1.080e-12 

5.820e-02 

2.920e-02 

0.591 

1.164 

mm 

<tll 

m 


Table 4. Errors for the 2-D problem in example 3 (double precision) 



Errors (Eoo) \ 


Direct 

128 x 512 

256 x1024 

2.9l6e-12 

2.700e-13 

1.697e-03 

8.511e-04 
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Applications of Nonuniform Fast Transform Algorithms in Numerical Solutions 
of Integral Equations 
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Abstract 

We review our recent efforts to apply the nonuniform fast Fourier transform (NUFFT) and fast Hankel 
transform (NUFHT) algorithms to numerical solutions of electromagnetic integral equations in frequency 
domains. The NUFFT is a fast algorithm to perform the discrete Fourier transform of data sampled nonuni- 
formly with 0(N log 2 N) arithmetic operations. Using the NUFFT scheme, we develop the nonuniform fast 
Hankel transform (NUFHT) algorithm. These NUFFT and NUFHT algorithms are then combined with the 
conjugate-gradient method to solve integral equations in Cartesian and cylindrical coordinates. 


I. Introduction 

In this work, we review our recent efforts to develop numerical solutions of Maxwell’s equations using 
the nonuniform fast Fourier transform (NUFFT) and nonuniform fast Hankel transform algorithms. 

The conventional method for solving integral equations in electromagnetics is the method of moment 
(MOM) [1], Although effective for small-scale problems, this method requires 0{N 3 ) arithmetic operations, 
where N is the number of unknowns, because of the need to invert a dense matrix. It is therefore impractical 
to use the MOM for large-scale problems. 

To reduce the computational complexity of integral equation methods, the conjugate-gradient fast 
Fourier transform (CG-FFT) method has been proposed which combines the iterative CG procedure with the 
FFT algorithm (see [2, 3] and the references therein). It results in a scheme with 0(KN log 2 N) arithmetic 
operations, where K is the required number of CG iterations. Furthermore, there is no need to store the 
MOM matrix since the matrix-vector multiply is carried out implicitly in the convolution calculation using 
FFT. The memory requirement is only O(iV), much smaller than 0(N 2 ) required by the MOM. For cylindri¬ 
cal coordinates, a similar method, called the CG-FFHT (conjugate-gradient fast Fourier Hankel transform) 
method [4, 5], has been proposed. 

However, because of the limitations of the available fast transform algorithms, these methods require the 
grid points to distribute uniformly in certain scales. Specifically, because of the FFT algorithm, the CG-FFT 
method require a uniform distribution of the grid points; The CG-FFHT method requires the grid points to 
distribute logarithmically uniform in the radial direction because of a coordinate transformation involving 
an exponential function in the FHT algorithm. This poses an undesirable limitation to the accuracy and 
flexibility of the spectral domain methods. For example, for problems with both electrically large and small 
regions, it is more efficient to sample nonuniformly so that a higher resolution is achieved in the electrically 
small region. This is not possible unless fast Fourier and Hankel transforms can be achieved for nonuniformly 
sampled data. 

Recently, the nonuniform fast Fourier transform algorithm have attracted considerable attention because 
of its importance in many applications [6-10]. With the NUFFT and the related nonuniform fast Hankel 
transform (NUFHT) algorithms [11], this work aims to remove this important limitation of a uniform grid 
in the spectral domain methods for integral equations. 
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II. Formulation 


The objective of this section is to develop the formulation for spectral domain methods using the 
nonuniform fast transform algorithms. The NUFFT algorithm provides a fundamental tool to nonuniform 
fast Hankel transform algorithm. These algorithms are then incorporated in the CG-FFT and CG-FFHT 
methods for integral equations. 

A. The NUFFT and NUFHT Algorithms 

NUFFT Algorithms. We first define a nonuniform discrete Fourier transform (NUDFT-1, denoted 
by T{) of {a*} 

N/ 2-1 

/,*{*[«]},« E j = -N/2,-“,N/2 -1 (1) 

k=-N/2 

where the frequency samples {u)j = 2 xj/N] are uniform, but the time samples {t k } E [—jV/2,lV/2] are 
nonuniform. A related nonuniform discrete Fourier transform (NUDFT-2, denoted by F 2 ) of {P k } is defined 
as 

AT/2—1 

9i = {F 2 \P))j = E i = -N/2,---,N/2 — 1 (2) 

k=-N/2 

where the frequency samples {tjj} € [—7r, 7r] are nonuniform but the time samples {£* = are uniform. The 
input data {a*} and {/?*} are the data at time sample points {4}. Note that the regular FFT algorithms 
do not apply to these DFT’s. A direct summation of (1) and (2) requires 0(N 2 ) operations. 

In [6-TO] fast algorithms with 0(7Vlog 2 N ) arithmetic operations have been developed for these DFT’s. 
In particular, in [8, 10] we use a least-squares method to interpolate an exponential function at a set of 
nonuniform points with another set of oversampled (q + 1) uniform points. Using this method, for example, 
(1) is rewritten as 


N/2-1 ?/ 2 

fjSj= E Q * E xi(t k )e a ^ mt ^ +e » m > N , j = —N/2,■ ■■,N/2 — 1 (3) 

k=-N/ 2 t=-q/2 

where Sj = cos(jV/miV) is called the accuracy factor, m is the oversampling rate (usually equal to 2), [m£*] 
denotes the nearest integer of mtk, and X((tk) are the interpolation coefficients. Using the least-squares 
method we find that the interpolation coefficients array of length (q + 1) can be written as = F~ 1 a(tk) } 

where array a and matrix F are in closed forms. More importantly, F — F(m, N,q) is independent of £*, 
a remarkable property that gives an efficient interpolation since F of dimension (q + 1) x (q 4-1) and its 
inverse only need to be calculated once, rather than N times. Once the interpolation coefficients are known, 
the DFT of the interpolated data on the uniform points can be performed efficiently by the regular FFT 
algorithm. 

The inverse DFT’s, i.e. to find {a*} and {fi k } from {fj} and {£,-} respectively, cost 0(N Z ) operations if 
the direct matrix inversion is performed. Instead, noting that the product of the matrix = [expfz^tUj)] 
in (1) and its complex-conjugated transpose produces a Toeplitz matrix, we use a CG-FFT method for the 
inversion which costs only 0(KN log 2 N) operations [6, 9]. We refer to these inverse algorithms as NUIFFT-1 
and NUIFFT-2, and denote them as T{ 1 and F^ 1 respectively. In this work, we employ the fast algorithms 
in [8-10] for these DFT’s which are optimal with the least-squares errors. They use the regular Fourier 
matrices for the interpolation of nonuniform data. 
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NUFHT Algorithm. The Hankel transform of a function f(p) is defined as 


/(*,) = j f(p)J n {k p p)pdp (4) 

0 

where J n is the Bessel function of order n. Making the change of variables p = pe ax and k p — ke~ ax , we 
can write (4) as a convolution. Using the convolution theorem, we then obtain the Fourier transform of 
fcexp(— ax)f(ke~ ax ) as 

H(u>) = H n (w)F(w) (5) 

where F(u) is the Fourier transform of pe ax f(pe ax ), and H n (u) is a closed-form Fourier transform of function 
afcpexp(ax)«7 n [fcpexp(ax)]. In the discrete form, if {x m } (m = —Nj 2,*•*, N/2 — 1) are nonuniform, we can 
find F(u)j) , thus at some imiform {uj} using the NUFFT -1 algorithm. Finally, from the definition 

of the NUDFT-1, the nonuniform Hankel transform is obtained as 

p&Xj 

i%i) s {Wn[/(p t )]},- = {^r 1 [#(“>*)]},- J = -N/2,-,N/ 2-1 (6) 

Alternatively, equation ( 6 ) can be replaced by an NUFFT-2 algorithm (see the following discussions on 
convolution for more details). The inverse Hankel transform can be obtained in the same way since its 
definition is identical to (4). The number of arithmetic operations is 0(N log 2 TV) [11]. 

B. Convolution on a Nonuniform Grid 

We now discuss the fast convolution on a nonuniform grid. To approximate this continuous operation, 
we assume that the input functions are band-limited and the truncation error is negligible. 

We consider the convolution of two functions g(x ) and /(x) with the desired results at a set of N 
nonuniform points {a;*} € [x m in = -IV/2,x m ax = N/2]: 

h(x k ) = {$(*) ® f{x)}(x k ) (7) 

where f(x) is assumed zero outside [xmiruZmax], while p(x) is the convolution kernel and may have an infinite 
support. In order to use the convolution theorem, we pad {/(a:*)} with N/2 zeros on both ends; The kernel 
function g(x) is calculated with 2 N points, possibly all nonzero. With a slight abuse of notations, we denote 
these two finite arrays of length 2 N by / and g , respectively. 

Now, using the convolution theorem, the NUDFT’s of h, g and / are related by 


N-i 

H(u j ) = G{u j )F(u j )= £ Az t h(Xk)e ix '-“ 1 , i = -N,- N - 1 ( 8 ) 

k=—N 

where x* e [~N,N], Uj = (; + l/ 2 ) 7 r/iV, and G and F are the NUDFT’s of { 0 (xfc)Ax*} and {/(x*)Ax fc }, 
respectively, with a length of 27V. Note G and F are defined in the same way as the right-hand side of ( 8 ). 
Therefore, the convolution result is approximated by 

f e -i*fcir/2N 'I 

Kx k ) » | ■ Fr 1 Wl J , k= -N/2, • • •, N/2 - 1 (9) 

where JFf 1 [H] stands for the NUIFFT-1 with a length of 2TV. 
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Alternatively, the convolution can also be calculated by using the NUFFT-2 algorithm instead of the 
NUIFFT-1 algorithm. Note that the continuous inverse Fourier transform is 

<x> 

h(x) = ■— J 1 ?(w)e-"*dw. (10) 

—CO 


Approximating this integral by the trapezoidal rule, we have 

p-iXkir/ZN 

{ftri)') t , k=-N/2,--,N/2-l ( 11 ) 

where Ti stands for the NUFFT-2 of length 2 N, and the asterisk denotes the complex conjugation. 

Thus, we have developed an efficient method to compute the convolution on a nonuniform grid with 
0(N log 2 N ) arithmetic operations. Note that in the above formulation, for convenience we have assigned 
x to a special range. In practice, x can take any range which can be easily transformed to the above range 
with a linear coordinate transformation. 

B. The CG-NUFFT Method 

We now combine the CG method and the NUFFT algorithm to solve integral equations in electromag¬ 
netic wave scattering problems. For simplicity, consider a 2-D TM- plane wave incident on an isotropic, 
inhomogeneous cylinder S(x,y ) with electric permittivity e(x,y), electric conductivity cr(x,y), and magnetic 
permeability p — po. From Maxwell’s equations we can derive the integral equation for the electric field 

E z {x,y)+u?p o J [c(a:',z/) - e b \g{x -x',y- y , )E s (x t ,y , )dx'dy' = E™ (12) 

5 

where e = t + ia fu and e b — are the complex permittivity of the cylinder and background medium, 

respectively, Fl nc is the incident field in the background medium, and S is the support of the scatterer. Note 
that the integral in (12) is convolutional. Hence, by using the convolution theorem we arrive at an operator 
equation 

CE z (x,y)=E? c ( Xl y) (13) 

where 

£[•] = [•] + F-HFluWi - h)( ■)] • (14) 

Here, T and T~ l stand for forward and inverse Fourier transforms in two dimensions. 

Similar to the CG-FFT method (see [ 2 , 3] and references therein), the CG-NUFFT method solves 
equation (13) iteratively using the conjugate-gradient method. Within each CG iteration, the forward and 
inverse NUDFT’s are obtained efficiently by the NUFFT algorithms with 0(7VTog 2 N) arithmetic opera¬ 
tions. Therefore the CG-NUFFT retains the computational efficiency of the CG-FFT method, but gives the 
important flexibility of nonuniform sampling. 

C. The CG-NUFFHT Method 

Similarly we can formulate the integral equation in cylindrical coordinates for axisymmetric problems. 
For a circular loop antenna located at {po,zq) in an axisymmetric medium with permittivity t{p,z), perme¬ 
ability p = po, and conductivity c(p,z), the integral equation can be derived as 

oo oo 

Ej(p, z) = Ef c + J dz J dp'g(p , p\ z - z l ) Afc 2 (p', z')E<t>{p‘ z ') (15) 

—oo o 
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where A k 2 = w 2 po(£ — €b) is the contrast function, and g is the Green’s function for the axisymmetric 
problem in cylindrical coordinates. Using the convolution theorem in 2 and Hankel transform in p, this 
integral equation can be written in an operator form [4, 5] 

£[£*(p,z)] = i?f(p,^ (16) 

where 

£[•] = [•] - r-m? { (j^jgn g) •*k»k* - *8) (•))} (i7) 

and Hi denotes the Hankel transform of order 1 in p direction, while T stands for the Fourier transform in 
z direction. These transforms are obtained by the NUFHT algorithm and NUFFT algorithms, respectively. 
Then the operator equation (16) can be solved by the CG method. In each iteration of the CG method, 
NUFHT and NUFFT algorithms perform the operation in (IT) with 0(A r log 2 N) arithmetic operations. 

III. Numerical Results 

We now present some numerical results for the above methods. Figure 1 shows the convolution of a 
function f(x) = cos(x) exp[-36(z-2.5) 2 /I/ 2 ] with another function g(x) = exp(—|x—2.5|) on the nonuniform 
grid within L = 11.4, —3.2 < x < 8.2. The function is sampled by N — 64 nonuniform points with the cell 
size Ax shown in Figure 1(a). The ratio between maximum and minimum Ax is Aa) ma x/Ax m i n = 40.7. 

We have applied the CG-NUFFT method to solve 2-D scattering problems. A square cylinder (e r = 2.56) 
in Figure 2(a) is coated with a very thin layer of dielectric (e T = 4). With a plane TM~ wave incident from 
the left, the induced electric current density calculated by the CG-NUFFT method is compared with the 
MOM results in Figures 2(b) and 2(c), respectively across the center lines in x and y directions. The radar 
scattering width is shown in Figure 2(d). For this problem, the number of unknowns is N x 'x N y = 48 x 48 
for both methods. The nonuniform grid is set up such that there are two grid points inside the thin coating, 
and larger cell size is used outside the coating. The CG-NUFFT method requires much smaller computer 
memory than the MOM. 

Finally we show an application of the CG-NUFFHT method for electromagnetic induction well logging. 
The axisymmetric, inhomogeneous medium is shown in Figure 3(a), where an induction tool consisting of two 
coaxial coil antennas, one as transmitter and the other as receiver, moves along the borehole axis. Because 
of the presence of the thin bed and the slightly different invasion radii in the two thick beds, a nonuniform 
grid is necessary both in z and p directions. Figure 3(b) shows the CG-NUFFHT results compared with the 
numerical mode-matching [12] results. If a uniform grid with the same cell size as in the thick beds is used, 
the CG-FFHT result will miss the thin bed, as clearly shown in the figure. 

Conclusions 

With the recent progress in the development of nonuniform fast Fourier transform and Hankel transform 
algorithms, it is now possible to use a nonuniform grid in an iterative solution of integral equations in 
electromagnetics. Here we have reviewed the applications of these nonuniform fast transform algorithms 
in the CG-NUFFT and CG-NUFFHT for two-dimensional integral equations in Cartesian and cylindrical 
coordinates. Compared to the conventional methods with a uniform grid, these spectral domain methods 
provide the important flexibility in modeling media with both electrically large and electrically small regions. 
Furthermore, we believe that the nonuniform grid can handle Gibbs phenomenon much better than a uniform 
grid. Much further investigation is needed in this regard to remove or reduce the Gibbs phenomenon 
associated with spectral domain methods. 
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(a) (b) 




Figure 1. The convolution of function f(x) cos(s) exp[-36 {x - 2.5) 2 /L 2 ] and g(x ) = exp(—|x - 2.5|) at a 
nonuniform grid through the NUFFT algorithm, (a) Cell size Ax as a function of the position, (b) The 
convolution on the nonuniform grid. 
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Figure 2. The scattering of a square cylinder coated with a thin dielectric layer, (a) Geometry, (b) The 
induced electric current density along the center lines in x and y directions compared with the MOM results, 
(c) The radar scattering width from the CG-NUFFT and the MOM. 
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Figure 3. Induction measurement in an axisymmetric medium with a thin bed. (a) Geometry, (b) Cell 
size Az in the vertical direction, (c) Comparison of apparent conductivity calculated by CG-NUFFHT, 
CG-FFHTj and NMM methods. 
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Abstract 

We consider a divergence-free non-dissipative fourth-order explicit staggered 
finite difference scheme for the hyperbolic Maxwell’s equations. Special one-sided 
difference operators are derived in order to implement the scheme near metal 
boundaries and dielectric interfaces. Numerical results show the scheme is long¬ 
time stable, and is fourth-order convergent over complex domains that include di¬ 
electric interfaces and perfectly conducting surfaces. We also examine the scheme’s 
behavior near metal surfaces that are not aliigned with the grid axes, and compare 
its accuracy to that obtained by the Yee scheme. 


1 Maxwell Equations 

We consider the two dimensional case with J = 0. The only sources for the problem 
are incident waves. These waves will be scattered after they encounter an obstacle. 
Furthermore, in free space e and fi are constants. The extension of the method to 
three space dimensions, sources and variable coefficients is straightforward. Maxwell’s 
equations now decouples into two independent sets of equations. We shall consider the 
TM system. Let r = ct = t/y/JIe and Z = where e and ^ are the permittivity and 
permeability coefficients, in free space, respectively and c is the speed of light. The TM 
wave equations then become: 

dE, 7( dH y dH x 
dr 1 dx 8y } 
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2 Finite Differences Scheme 


We approximate the spatial derivatives to fourth order, using the following explicit 
method for In¬ 
explicit (2,4): 


dU 

dy ij+i/2 


; (Uij-i - 27 U i}j + 27 U itj+l - U id+2 ) 


We shall refer to 2 as the explicit(2,4) scheme. At the first and last points we use a 
fourth order and fifth order accurate one sided approximation to the derivative. We 
note that this is used only in order to globally approximate the derivative. No physical 
boundary conditions are included at this stage. The approximations are as follows: 


dy i, 1/2 24A y 


(-22^,0 + 17tt f i + 9 U it2 - 5f/ i)3 + U iA 


~ 24Ay (~ 23Ui > 1 / 2 + 2lU W + 3£/ W2 ~ U i,Vi) 

= 24A^ ( 2 ^^»^- x /2 “ 21 £fj,jv_3/2 — SUi,N-5/2 + U^N- 7 / 2 ) 

= 0 r r (22^ jy — 1717^,;v_i - 9Ui N-2 + 5C/j jv_ 3 £^- 4 ) 

t,jv—1/2 24A </ 


-23 21 3 -1 . 

1 -27 27 -1 . 

0 1 -27 27 -1 


1 -27 27 -1 

1 -3 -21 23 
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—22 17 9 —5 1 . O' 

1 -27 27 -1 .0 

0 1 -27 27 -1 0 

0 . 1 -27 27 -1 

0 .-1 5 -9 -17 22 _ 

The matrix form of The TM equations is: 

[EZ^ = [EZ id } n + £A H [HY i+ i/ 2 jJ n+1/2 - ^rHX iJ+1/2 ] n+1 / 2 A‘ H 
{HX itj+l/2 ] n+l ' 2 
m + i/2jr i/2 


= [HX iJ+1/2 r^ - —[£2y]“A ‘ e 

= [i?y i+1/2j -]"- 1/2 - ^A E [EZij] n 



3 Time Step 

For each numerical method one must choose a time step for the numerical integration. 
This time step is based on two considerations: stability and accuracy. Since all the 
schemes have stability limits this places an upper bound on the usable time step. 

Since we wish higher order accuracy it is also necessary to limit the time by accuracy 
requirements. We do not want the accuracy of the scheme to be determined by the time 
integration. Hence, the temporal error should be equal to or less than the spatial error. 
For the explicit(2,4) scheme the time step chosen depends on the accuracy desired. As 
the mesh is refined the spatial error decreases as a fourth order scheme and so decreases 
faster than the temporal error. Thus, the time step should depend on (Ax) 2 . If the 
error requirements are too severe then this is inefficient and the leapfrog in time should 
be replaced by a fourth order Runge-Kutta method. However, for the experiments in 
this paper we shall use the same leapfrog method for this scheme. 
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4 Computational Results 


We wish to compare the efficiency of three different schemes: the standard Yee scheme 
([ 1 ]), the explicit( 2 ,4) scheme and the Ty(2,4) ([ 2 ]) scheme advanced in time by a leapfrog 
method. For conciseness of notation we shall refer to these schemes as Yee, and ex¬ 
plicit ( 2 ,4) and the Ty(2,4) respectively. For all computations we choose Z = 1 unless 
it’s stated otherwise. In some of the cases we used the Berenger PML layer in the far 
field or Gedney’s PML method, where it was needed. In all cases the error is measured 
between the exact E z and the numerical E z in L 2 norm. 

We first consider the simplest mode of propagation in a rectangular cross section 
wave guide. We take the walls to be perfect conductors. For simplicity, we assume that 
the permeability is everywhere 1. We take the followin boundary and initial conditions: 


E z {x,y, 0) 
At 

Hy(x, y,—) 

H x (x, y,~) 
Ez(0, y,t) 
E z ( l, y,t) 
E z (x, 0, t) 
E z (x,l,t) 


sin( 37 T£) sin( 47 n/) 


: sin( 37 i 


57rAt 


) sin( 47 ry) 


5 v 2 

4 5ttA£. ... . 

—7 cos (3 th; --—J sm( 47 ry) 

5 2 

— sin( 57 rt) sin( 47 n/) 
sin( 37 r — birt) sin( 47 rj/) 


0 

0 


The solution is then: 


E z (x , y , t) = sin(37r£ — 57 rt) sin( 47 ry). 

For the three schemes we choose uniform gridspacing. For the Yee, the explicit(2,4) and 
the Ty(2,4) schemes we take: h = Ax — Ay = 1/20,1/40,1/80. For the Yee scheme we 
take At = 2h/3 and for the explicit(2,4) scheme we take At = h 2 . 

In table 1 we present the errors in L 2 norm for the Yee and the explicit(2,4) scheme. 
In both cases the errors are almost linear in time. However as the mesh is refined the 
Yee scheme yields second order accuracy while the explicit(2,4) yields between fourth 
and fifth order accuracy. 
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Table 1: The maximal errors in L 2 norm. 

We next consider the treatment of a domain which contains air and a lossless dielectric 
with a relative permittivity of e 2 as shown in Fig(l). 



Figure 1: The computational domain 


E z is continuous but its second derivatives are discontinuous. To overcome this 
problem we use an innovative approach, which is presented in [6] and [7]. This approach 
helps to preserve the fourth order accurcy of the explicit scheme. In this approach 
we divide the computational domain into three subdomains. Two contain air and the 
third one contains the lossless dielectric. On the interfaces the treatment differs. On the 
interfaces both the electric and magnetic fields are approximated. Suppose the interfaces 
are located at i = Ii and i = J 2 and e = e 2 for I x < i < I 2 and e = 6i at i > h and 
i < I\. We approximate H y at i = Ii and i = I 2 by using the following fifth order 
extrapolation: 

rrn+l/2 _ 315 Tjn+1/2 _ 105 fjn+ 1/2 . 189 rrn+l/2 _ 45 rrn+l/2 i 35 rrn+l/2 

"y/iJ 128"y/!-i/ 2 ,i 32 2 / 2 ,j -r 64 "y/j- 5 / 2 ,; Z2 I *yi l -7/2,j ^ 128^!-9/2,; 

rrn+l /2 _ 315 -rrn+l/2 _ 105 rrn+l/2 i 189 rrn+l/2 _ rrn+l/2 i .35. rrn+l /2 
a Vi 2 j 128 iJ y/ 2+ i/ 2l ; 32 Ji y/ 2 + 3/2,; ^ 64 Ji yj 2 + 5 / 2 ,; 32^y/ 2 +7/2,; ^ 128 ^/ 3 + 9 / 2 ,; 











Hy+ 1 / 2 can be extarpolated by using the points to the left of Ii or to the right of I \. This 
is true because H y is a continuous function. Once H y is approximated on the interface 
we approximte the x derivative of H y using H yii j and H y , 2 j This is done in order to 
approx imate the time derivative of E z . This is done the following way: 


—H n+ V 2 
dx yi "> 

— rjn+1/2 

dx yi ™ 


352 tj _ 35 tt 

105 "Ww 8 12 Vh +1/2 


352 tj » 35 tt 
"10* r 8 a y J 2 -i/2 


I 35 TJ 

T 24 ^1+3/Jj 

— 

24 I2 yi2-3l2,j 


_ 21 TT I 5 TJ 

4Q- n yii+S/2,j ' 46 X2 y/ 1 +7/2,j 

I 21 U _5_ TT 

" r 40'°J'/2-5/2 j 46' n 3'/2- 7 /2,i 


Once and ^H y ^ 2 are calculated we can evaluate EZ 1 }^ and EZ 7 }^ the 

following way: 




“ 24Ay {^ x h,j-Z /2 + 27H XJiJ+ 1 / 2 #x /lii+a/a ) 

I dt f 352 rjn+1/2 35 rrn+1/2 » 35 trn+1/2 _ 21 rrn+1/2 i j5_ irn+1/2 \ 

T Ai V 105 ^ 2 //! 8 "y/ 1 + i/ 2> j “ 24 ’ /J 2 // 1 + 3 / 2 j 40 JI/J+ 5 / 2 ,; " r 46 - n y/ 1 + 7 / 2 J 7 




= - afe - 2r^ /2 ,_ l/s + 27H Xh j+l/i - H XhJ+m ) 

I eft tfn+1/2 (. 35 rrn+1/2 _ 35 rrn+1/2 « 21 rrn+1/2 _ 5. rrn+1/2 \ 

V 105 ^VlzJ 8 -"3/72+1/20 24 JJ y/ 2 +3/2o ~ 40"y7 2 +5/2J 46"y/ 2 +7/2oV 


Since -^H y ^/ 2 and -feHy+V 2 are approximated where e = 62 we set e = 62 at 2 = /i 
and i = / 2 . ^The extension of this innovation to two and three dimensions can be found 
in [ 6 ]. 

The slope of the Yee scheme is 2 and the slope of the Ty(2,4) scheme and the ex¬ 
plicits^) converges to 4 as can be seen in table 2 


scheme 

ft 

At 

Mgx(|| error | |i 2 ) 

0 < t < 10 

rate 

explicit( 2 ,4) 
explicit( 2 ,4) 
explicit ( 2 ,4) 
Ty(2 5 4) 
2V(2,4) 

am 4 ) 

Yee 

Yee 

Yee 

1 

\° 

f 

f 

f 

40 

80 

20 

40 

80- 

I ’ 
450 

1600 

3200 

4^0 

1600 

6400 

¥ 

—120. 

6.9239 x 10 -a 
3.5486 x 10 “ 6 
2.0112 x 10“ 7 
2.7043 x 10 - 5 
1.4233 x 10 “ 6 
1.1040 x 10 - 7 
0.0095 
0.00237 
5.9442 x 10 - 4 

4.286 

4.141 

4.249 

3.688 

| 2.003 
i 1.9953 
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Table 2: The maximal errors in L 2 norm with e 2 = 4. 

Next we consider a monochromatic isotropic point source of wavelength 0.25, that is 
switched on at t = 0 in the presence of an infinit perfect conductor (fig(2)). 



Figure 2: The computational domain. 

The domain is — oo < x,y < —oo < y < oo. For Yee’s scheme we choose h=~, 

At = y, for the explicit(2,4) and theTy(2,4) schemes h = At = ti 2 . The point 
source is modeled by adding a term representing a current I z (t) = 0.01 sin(87rt)e(t) at 
(a:, i/) = (i, ~) where e(t) denotes the Heaviside unit-step function. The equations in 
this case are: 

, dH y BE, yj fj\e/ \ 

nr z ^-~di ] - zhmr - rs) 

dH x _ 1 dE z 

dt Z dy 

dHy = 1 dE z 
dt Z dx 

With the boundary conditions: 

E z (l/2,y,t) = 0 (5) 


In figures 3 we draw the errors in L 2 norm. 
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Next we test these three schemes in [0,1/2] x [0,1/4] x [01/2] domain. An exact 
solution in this case can be [8]: 

H x = sin(c<;i) sin (Ax + By 4- Cz) 

H y = sin(o;t) sin(Ax 4- By 4- Cz) 

H z = sm(ujt) sin (Ax + By + Cz) 

_ jg 

E x = -cos(o it) cos(Ax + By + Cz) 

cj 

„ A — C 

E y = -cos(o;t) cos (Ax 4- By 4- Cz) 

CJ 

B — A 

E z = -cos(atf) cos (Ax + By + Cz) 

cj 

Where 

J 1 = a 2 + b 2 +c 2 

0 = A + B + C 

We choose: 

A = 7r, B = —27r, C = 7T, cj = \/67r. 

In table 3, we can see that for the explicit(2,4) and the Ty (2,4) schemes we have used 
At = h 2 and for the Yee scheme we have used At = The explicit(2,4) as well as the 
Ty(2,4) schemes behave better than expected and gives almost fifth order of accuracy. 
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scheme 


rate 


scheme 

h 

At 

Mai(jlerror||z, 2 ) 

0 < t < 10 

rate 

explicit( 2,4) 
explicit ( 2,4) 
explicit(2, 4) 
Ty( 2,4) 
Ty( 2,4) 

Ty( 2,4) 
Yee 

Yee 

Yee 

1 

7 

7 

7 

\° 

f 

f 

20 

1 

f 

. so 

1 

4J0 

1600 

3200 

4J0 

1600 

6400 

f 

7 

nn 

5.375 x 10- 4 
2.184 x 1CT 5 
9.071 x 10- 7 
3.621 x 10 -4 
1.144 x 10" 5 
3.5621 x 10- 7 
0.0027 

7.3 x 10~ 4 
1.8252 x 10“ 4 

4.621 

4.590 

4.983 

5.005 

j 1.9028 
2.0015 


Table 3: Comparison of the errors in L 2 norm 


5 Computational Cost Comparisons 

In order to compare the efficiency of the explicit(2,4), the Ty(2,4) and Yee scheme we 
take the following boundary conditions: 

E z = sin(37rx) sin(47n/) 

H y = (3/5) cos(37rx) sin(47ri/) sin(—-—) 

H x = — (4/5) sin(37ro;) cos(47ry) sin(^^) 

The exact solution in this case is: 

E z = sin(37rx) sin (47ry) cos (57ft) 

For the Ty(2,4) scheme and the explicit(2,4) scheme we use a uniform gridspacing with 
Ax = Ay — A. For th e Yee scheme we also use uniform gridspacing with Ax — Ay = 
Aq. We chose these mesh sizes in order to get the same error between the exact E z and 
the approximated E z in L2 norm. The comparison is shown in table 4. The programs 
were written in fortran and run on a Digital Alpha workstation. 


scheme 

h 

At 

| 

Max(\\error\\i, 2 ) 

0 < t < 10 

CPU — time 















Table 4: CPU-time using various difference schemes. 

The CPU time needed to achieve the same accuracy in Yee’s case is more than 15 
times larger than required for the Ty(2,4) scheme and 91 times larger than required for 
the explicit(2,4) scheme. 


6 Discussion and Conclusion 

The results demonstrate that we can use a coarser mesh with the fourth order scheme 
and still get the same accuracy as with the Yee scheme. This is true even in the presence 
of a dielectric media. 

Although this scheme is not as good as the Ty(2,4) scheme[2], it is still easier to 
modify an existing code based on the Yee scheme and make it fourth order accurate, by 
using the explicit(2,4) scheme. This is true because in the Ty(2,4) scheme one has to 
inverse a matrix by using a LU decomposition. 
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A stable second order Cartesian grid finite difference scheme for the 
solution of Maxwells equations is presented. The scheme employs a stag¬ 
gered grid in space and represents the physical location of the material 
and metallic boundaries correctly, hence eliminating problems caused by 
staircasing, and, contrary to the popular Yee scheme, enforces the correct 
jump-conditions on the field components across material interfaces. 

To validate the analysis several test cases are presented, showing an 
improvement of typically 1-2 orders of accuracy at little or none additional 
computational cost over the Yee scheme, which in most cases exhibits first 
order accuracy. 


Key Words: Finite difference schemes, Maxwells equations, Complex geometries, Embedded interfaces, Stability, 
Staircasing 


1. INTRODUCTION 

The development of efficient, yet simple, computational methods for the accurate time-domain solution of 
Maxwells equations remains a very significant challenge. In recent years attempts were made to apply high- 
order finite difference scheme, and multi-domain spectral methods to solve the problem. These methods, 
however, share the disadvantage that they either are restricted to simple geometries or that a multi-domain 
framework must be introduced to facilitate the correct treatment of general metallic boundaries and material 
interfaces. 

The very popular Yee scheme, introduced in [3] uses 2nd order finite-difference scheme on a staggered grid. 
This straightforward approach remains the by far most popular time-domain computational techniques for 
the modeling and design of problems in computational electromagnetics. An updated review on this method 
can be found in [1, 2]. 

The implementation of the Yee scheme in complex geometries presents the difficulties of approximation the 
boundary and the boundary-conditions. The straightforward using of a staircased curve affects the overall 
accuracy and essentially reduces the scheme to first order. In the case of metallic boundaries this problem 
has received considerable interest in the past, see e.g. [5] and references therein. Most proposed methods, 
however, sacrifice the simplicity of the original Yee scheme to achieve the improved accuracy. 


1 Corresponding Author 
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A more subtle problem that has received significantly less attention is the effect of using staircasing around 
a non-metallic embedded interface, i.e., a transparent interface. While the conditions connecting the field 
components on both sides of such media are well known, no effort is made in the Yee scheme to enforce 
these conditions. To overcome the problems associated with such an approach the standard technique is to 
extend the size of the interface by introducing a transition zone in which averaged material properties are 
employed. While such techniques appears to improve the overall accuracy, it has nevertheless been shown 
that the global accuracy of the scheme is reduced to first order [4]. This is a one-dimensional result and, as 
we shall show here, the situation is much worse for problems beyond one dimension. 

To specifically address the problems associated with embedded metallic and general material interfaces 
when solving Maxwells equations, we present in this work a novel finite-difference scheme that maintains 
global second convergence in the presence of arbitrary interfaces, curved as well as straight. It is worth noting 
that staircasing problems at metallic boundaries are resolved at no additional computational cost and internal 
material interfaces are treated in an equally efficient way. Indeed, the main additional computational cost of 
this new scheme lies in a preprocessing stage and adds only little to the cost of solving the time-dependent 
problem. 


2. THE ONE-DIMENSIONAL SCHEME 

To illustrate the central elements of the proposed scheme, let us consider the solution of the one-dimensional 
Maxwell equation 


dE 

dH_ 

6 at ~ 

dz 

dH 

dE 


dz 


(i) 


where E(z,t) and H(z,t) signifies the normalized, mutually perpendicular tangential electric and magnetic 
field components, while e and fx refers to the relative permitivity and permeability, respectively, of the 
materials. 

In case of a perfectly conducting wall (PEC) the boundary conditions becomes 


EiZpect) = 0 or 



( 2 ) 


where z pec signifies the position of the wall. In case the interface has finite conductivity the field components, 
termed (E^\H^) and (E^ 2 \ H^) are continuous across the material interface, situated at z mat , as 


To numerically solve Eq.(l), we introduce a spatially staggered grid with grid size h as 

Zj ~ hj , z J+ i = h(j + |) , (4) 

and embed the full problem into this simple Cartesian grid. To account for the possibility of the physical 
problem not being aligned with the grid, we introduce 7 € [ 0 , |] which measures the relative distance between 
the physical boundary and the first grid point, see Fig. 1. 

The two field components, E and H, are collocated as 
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LJ<2) u<2) 

1 N- 3/2 11 N- 1/2 


E (2 ) 
*- N- 1 


T?h 


FIG. 1. Definition of grid, numbering and various parameters for solving the one-dimensional Maxwells equations in a 
PEC cavity filled with two materials. 


E( z j,t) — Ej , H(zj + ^,t) — i?j + i 


as illustrated in Fig. 1. The actual limits on j depends on the individual computational problem and it 
should be clear that the first and last grid points in the computational domain can be an E~ as well as an 
Ff-nodes. We recall that e as well as [i may depend on z, possibly in a discontinuous manner, and that the 
location of such material interfaces as well as the enclosing walls need not be aligned with the grid. 

Let us consider the situation of a perfectly conducting cavity, enclosed by two PEC walls situated at 
Zpec and Zpec- The cavity is assumed to be filled with two regions of different materials, having properties 
and and with the interface situated at z mat as illustrated in Fig. 1. We introduce 

the two sets of fields, with k = 1 , 2 , representing the solution in the two regions of different 

materials and it assume that the two solutions are given on two separate grids and connected only through 
the conditions across the material interface, Eq.(3), much like in a multi-domain solution technique. For 
simplicity we assume here that N , being the number of grid points in each domain, is the same in both 
regions. This, however, is done purely to simplify the notation and the generalization for the general case is 
straightforward. 

To account for the situation where the interfaces, be they material interfaces or PEC walls, do not coincide 
with a grid point we associate with each region two parameters, 7 ^ and 7 ^, which measure the distance 
from the first/last grid-point to the physical position of the wall or interface relative to the grid size h. 
Clearly, 7 W € [0, 5 ] and, due to the global equidistant grid, 7 ^ + 7 across a material interface. 
Since these parameters are only geometry dependent, they can be computed and stored in a preprocessing 
stage once the grid has been defined. 

In the interior of each of the two regions with constant material parameters, we shall use the standard 
staggered scheme as 








dt 


(5) 


and 


i e [0, TV _ 2] 




dH\ 


(*) 




jr(*) 

~i+L 



( 6 ) 


where k = 1,2 and the numbering follows that in Fig. 1. Without loss of generality we assume that the 
material properties are constant within each region of the cavity. This, however, extends trivially to include 
problems where the materials vary smoothly within each domain. 
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To complete the scheme we shall introduce special schemes for updating E^ and , in such a way 
that the boundary conditions at the PEC walls, the material interface and the physical position of these 
relative to the grid is accounted for correctly. 

For we use the following second order approximation 

dE (i) ^(i) 

(1) O £, 0 y L _J_ 

dt l + h • (7) 

The scheme for updating H^_ k is simpler as we can explicitly exploit that E^ 2 \z pec ) = E pec = 0 to 


dff™ i 

,,( 2 )— 


E P ec “ E\j 


2#+i 


Let us now consider the treatment of the material interfaces across which we know that the individual field 
components must remain continuous. However, one should not, as is implicitly done in the traditional Yee 
scheme, assume that the fields are smooth across the boundary and simply use and to update 

jj-C 1 ) 

n N- 1 / 2 * 

To solve this problem, we introduce the extrapolated value 


J Bmat = (l + 7i J W ) -Ti 2) ^P ) , 


and complete the scheme for updating as 


_ 2 E m at ~~ 

^ dt ~ 2 t ^ + 1 h 

Likewise we compute the flux of E^ by using the extrapolated magnetic field 


= (1 + 7& ) )ff£l 1/2 - ifi’agi. 


.m dEp 7 ' _ 


The rigorous proof of stability and accuracy of this scheme is given in [5]. It should be noted that 7 ^ 
always appear as + 1 (c = 1 at metals, otherwise c = 2) when ever being in the denominator. As a 
consequence the scheme can be expected to be well behaved for all values of Jr and 7 £. 

The application of the scheme in multi dimension domain is more complex due to the boundary conditions, 
which involve a combination of all the components of the vector-fields E and H. In order to deal with this 


920 




FIG. 2. The discrete L 2 -error at t — 2tt for the electric (a) and the magnetic (b) field components in an air-filled metallic 
cavity as a function of the resolution h = 2 (N. The dashed curves are obtained without properly accounting for the correct 
position of the metallic walls while the full curves are obtained with the new scheme. 


problem extrapolation of the fields should be made to and along the boundaries. A comprehensive discussion 
on the boundary condition representation in two and three dimensions is given in [5]. 


3. A FEW NUMERICAL TESTS 

Let us consider a simple test case consisting of a one-dimensional electromagnetic resonator with perfectly 
conducting walls located at ZpU = — 1 and Zpei = 1- The interior of the resonator can be either air-filled or 
filled with two di-electric media with the material interface at z mat = 0 and /i^ 1 ) = /A 2 ) = 1 . 0 . 

For = n^ 2 ) the solution to Eq.(l) is a standing sinusoidal waves. In the case ^ the solution 
is sinusoidal in each of the domains and is continuous but non smooth at the material interface. 

We shall seek the numerical solution to the cavity problem on the staggered grid, Eq.(4). Note that for 
7 = 0 the grid coincides with the boundaries as well as the material interface at r mat = 0 while for 7 > 0 , 
the whole grid is simply shifted towards the right, creating a situation exactly as sketched in Fig. 1 with 
7 l° = 7 and 7 * 5 = \ - 7- 

Let us begin by considering just the vacuum filled cavity, i.e., nW = n( 2 ) as 1.0. In Fig. 2 we plot the 
global error at t = 2k as a function of N = 2/h for various values of 7 . As discussed in [5, Sec 2.2], the 
classic staggered scheme and the new scheme are equivalent for the perfectly conducting homogeneous cavity 
with 7 = 0 and we observe, as expected, a global 0(h 2 ) convergence in Fig. 2 for both the E and the H 
component. However, for 7 > 0 the schemes yield very different results with the classic scheme being reduced 
to G(h ) convergence as a direct result of the geometry of the actual problem being approximated to first 
order only. The new scheme, however, maintains second order global convergence for both field components, 
yielding a dramatic improvement in accuracy at no additional computational cost, compared to the classical 
scheme. 

The situation for a metallic cavity filled with two different materials, having = 1.0 and nW = 1 . 5 r is 
even more alarming when one considers the performance of the classic Yee scheme. In Fig. 3 we show the 
decay of the global error with increasing resolution when using the staircased approximation as well as the 
new scheme where no staircasing is introduced. We note in particular that even when the grid is aligned 
with the geometry is the staircased approximation reduced to a first order scheme at best. 

For the new scheme the global second order convergence is recovered for all values of 7 , confirming the 
analysis of [5, Sec 2.2]. 

To illustrate the particular importance of correctly imposing the jump-conditions when solving multidi¬ 
mensional problems, we consider the TE problem in a square cavity filled with two di-electric materials. The 
boundary between those materials is a straight line tilted with respect to the z-axis by an angle 9. 
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FIG. 3. The discrete X, 2 -error at i ss 2 -tt for the electric (a) and the magnetic (b) field components in a metallic cavity 
as a function of the resolution h s= 2/N. The cavity is filled with two materials with the interface being positioned at z = 0. 
The dashed curves are obtained without properly accounting for the position of the interfaces and the correct jump conditions 
while the full curves is obtained using the new scheme. 




FIG. 4. In a) we illustrate the initial conditions for the modified one-dimensional cavity problem in the case where the 
material interface is assumed tilted at 9 ss 30°. In b) we show the global L 2 -error of E for different angles, i.e., the degree of 
discontinuity of E at the interface, as computed using the staggered Yee scheme and the new non-staggered approximation. 


We consider a situation in which a plane wave impinges on the interface from the left propagating along 
the z- axis. 


E*' W = 


s(2) 


e(2 )«2 + ill 






1 + — 1) cos 2 6 


-E T 'W 


(13) 


We observe that for 6 — 0, E s is continuous as expected while for 0 = tt/ 2 it experiences a maximum jump as 
it becomes a purely normal component. The degree of discontinuity of E x across the interface in controlled 
solely by £ and 0. 

We shall now attempt to model this problem using the Yee scheme and the newly developed scheme. To 
make things even simpler we shall assume that 7 = 0, i.e., the material interface is located at a grid point 
although the interface generally is tilted at some angle. 

Due to the nature of the Yee scheme, it is naturally unable to correctly model the discontinuous field 
component. As we see in Fig. 4, a consequence of this is that the Yee scheme is globally non-convergent for 
a problem with a discontinuous solution. One should recall that while one observes global non-convergent 
behavior, the solution is locally divergent. 

Using the new method, however, the correct solution is recovered to global 2nd order accuracy. That is 
orders of magnitude better than obtained with the Yee scheme at the same work. 
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The relevance of this study is two-fold. On one hand it demonstrates the ability of the new scheme 
to accurately and efficiently model problems with discontinuous solutions. Secondly, and perhaps most 
importantly, it illustrates the inability of the Yee scheme to handle such problems. For more examples an a 
detailed analysis see [5, Sec 2.2]. 


4. FINAL REMARKS 

The purpose of this paper has been twofold. On one hand, we wanted to show that the need in the 
Yee scheme to introduce a staircased approximation of curved metallic boundaries dramatically reduces the 
overall accuracy of the scheme which at best is first order. However, the situation at transparent interfaces 
is much more troubling. As the Yee scheme is applied everywhere in the computational domain no effort is 
made to impose the proper jump conditions on the electric and magnetic fields. Since these are continuous 
at best, this clearly introduces another source of significant error which has received limited attention in the 
literature. Indeed, as we have argued and confirmed through a very simple computational example, this may 
result in a non-convergent approximation. No averaging of material properties or a variety of other simple 
techniques can restore this property. 

It has been the purpose of the second topic of this paper to present a novel second order scheme, building 
on the superior behavior of the Yee scheme in homogeneous regions and utilizing the same spatially staggered 
grid, yet modified in such a way that the need for staircasing is eliminated and physical jump-conditions 
are imposed to the order of the scheme. The new scheme is globally second order accurate for arbitrarily 
embedded interfaces. Moreover, there is no extra running-time computational cost with respect to the Yee 
scheme. 

The aim of the current research is to extend the ideas introduced in this paper to high order schemes. 
The first steps in a development of suitable fourth order schemes have been taken in [6] where a fourth order 
scheme has been studied computationally, yielding very promising results. Encouraged by these results as 
well as our own initial studies, we hope to be able to present a stable fourth order Cartesian grid method 
for Maxwells equations in the near future. 
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Abstract 

Small printed antennas are becoming one of the most popular designs in personal wireless 
communication systems. Recently, a new type of printed meander line antenna was introduced. 
In this paper, the characteristics of a novel tapered meander line antenna are presented and 
analyzed using the finite difference time domain technique. Experimental verification is applied 
to ensure the effectiveness of the numerical model, and very good agreement is found between 
numerical analysis and measurements. A new design of this antenna features an operation 
frequency of 2.55 GHz with a 230 MHz bandwidth, which supports future generations of mobile 
communication systems. 


Introduction 

With the advancements of modem integrated circuit technologies, personal communication 
systems (PCS) features light weight, small size, high frequency operation, and high transmission 
efficiency. Mobile antenna design is one of the major tasks in PCS designs, which requires easy 
integration with the interior circuitry. One of the most widely used wireless communications 
systems is the global system for mobile (GSM) communications, which operates at 890-915 
MHz for uplink and 935-960 MHz for downlink. The new generation of personal communication 
systems, such as digital communication systems (DCS) 1800, operates at 1.710-1.785 GHz for 
uplink and 1.805-1.880 GHz for downlink. Another widely adopted telecommunication system 
for PCS is the code division multiple access (CDMA) system, which operates at 1.8 to 2.0 GHz. 
In addition, recently, indoor cordless phones and modems for wireless local area networks 
(WLAN) operating at 2.4 GHz are also available in the market. Therefore, antennas for current 
and future generation of personal communication systems are designed to operate at the 
frequency range from 0.9 GHz to 2.5 GHz. If the future use of higher frequency PCS and the 
possibility of applying PCS for military purposes are considered, then antenna designs for 
wireless personal communication systems should be expanded in scope to cover the frequency 
range from the current 0.9-2.5 GHz to 0.9-3.0 GHz range. New types of printed meander line 
antennas have been recently investigated by several authors [1-6]. 

In this paper, studies for the characteristics of a novel tapered meander line antenna, using the 
finite difference time domain (FDTD) [6] techniques with Berenger’s perfectly matched layers 
(PML) [7] absorbing boundaries are presented. The presented designs of the antenna shown in 
Fig 1, feature small dimensions (32 x 21 x 3.17 mm 3 ), and approximately 50 Q input impedance 
with a dual printed sleeve tuner, and it operates within the 0.9-3.0 GHz range on a comparably 
small ground plane (60 x 27 mm 2 ). The validation of the numerical analysis used in this 
investigation is made by computing the return loss of the tapered antenna and comparing it with 
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measurements. Very good agreement is found using the developed FDTD code. Analysis and 
design based on this numerical model are thus performed. A wideband design is achieved by 
using a dual sleeve tuner [2, 6], whose operating bandwidth is centered at 2.5 GHz and with 230 
MHz of band-width. This optimal design can be used for future generation of wireless phone or 
current other wireless applications operating around this frequency. 


Analysis 

The antenna considered in this investigation, as shown in Figure 1, is a tapered meander line 
trace printed on a 21-mm dielectric slab sitting on a 60 x 27 mm 2 perfectly conducting ground 
plane. The parameters el and e2 represent the lengths of the vertical and horizontal printed 
traces, respectively. The vertical segment length el ~ 2 mm, and the horizontal segment length 
e2 starting from 4 mm ending at 17 mm with a 1 mm increment for each horizontal segment, 
which results in the vertical length of the antenna Lax - 29 mm. The distance between the edge of 
the dielectric slab and the edge of the ending segment of the meander line is set equal to el. The 
reason for modeling the meander line antenna on a small ground plane (60 x 27 mm 2 ) is to 
simulate its performance when this antenna is placed on top of a PCS handset or a small effective 
ground plane of wireless appliances. The main objective of this research is to design a meander 
line antenna with 50 LI input impedance at the operating frequencies within 0.9 to 3.0 GHz. The 
frequency range under study is extended from 0.9-3.0 GHz to 0.001-10.0 GHz for clearer 
observations of the resonance behavior of the antenna. 

The finite difference time domain (FDTD) technique is applied to model the antenna inside a 
3-dimensional air chamber terminated with the artificial boundaries of Berenger’s perfectly 
matched layers (PML) [7]. The PML is adopted to reduce the numerical reflection from the 
truncated boundaries of the finite problem space, which resembles the anechoic chamber for 
antenna measurements. An array of voltage sources with a Gaussian waveform is placed between 
the ground plane and the edge of the first vertical segment of the trace line for excitations. 
After the characteristics of the tapered meander monopole are analyzed, dual printed sleeves are 
employed in the design to enlarge the bandwidth and tune the input impedance toward 50 Q. 
Another tuning method using dual floating printed sleeves is studied, which can be used to model 
the behavior of the dual sleeves that are not well grounded. 


Results 

In order to verify the numerical analysis, a tapered meander line antenna is built and 
measured. To prevent bending of the dielectric substrate, the width is increased 2 mm more on 
each side than the width specified in the previous section. The thickness of the PCB is one half 
of that is used in the simulations. The performance of this antenna is measured on a 1.5 x 2-m 2 
large ground plane to avoid the influence from surrounding objects. This ground plane is 
effective to simulate a semi-infinite ground plane for GHz application. The test is done using HP 
8510C network analyzer, by measuring the return loss sweeping frequency up to 10 GHz. As 
shown in Figure lb, there is a 43-mm coaxial cable connected to the fixture of the antenna that 
serves as an antenna feed. This coaxial cable is excluded in measurement results by applying port 
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extension to the electrical length of the cable, for establishing a reference plane at the antenna 
feed, as determined by the time domain reflectometry (TDR). 

After calibration, the analyzer is switched to time domain mode, and the antenna 
characterizations are measured in the time domain. To locate the starting plane of the antenna 
trace, the antenna is shorted to the ground at the feed point, which results in total reflection with 
180 degrees out of phase. Therefore, by locating the reflection with magnitude of 1 and by 
measuring of the time difference between this reflection spike and analyzer’s reference plane, the 
information to translate the network analyzer reference to the apparent antenna reference plane is 
obtained. Therefore, the actual electrical delay can be determined by using one half of the time 
difference between the analyzer reference plane and the total reflection spike, which is 212.5 
picosecond for this antenna model. The one half factor is required because the analyzer presents 
the time domain reflectometry (TDR) data for twice the traveling time between source and load 
to detect this total reflection. The computed impedance for the taper meander line antenna placed 
on a semi-infinite ground plane can be obtained by computing half of the meander line dipole 
configuration impedance, which is analyzed using a cell size of 0.5 mm with 2 cells used for 
modeling the source. Very good agreement is observed up to 10 GHz between the numerical and 
experimental results. The slight frequency shift in magnitude and phase differences at higher 
frequency range are mainly accounted by the different feeds used in the numerical and 
experimental methods and the conductor and dielectric losses at higher frequencies which are not 
considered in the FDTD simulation. From comparisons of results in Figure 2, the effectiveness of 
the numerical model and methodology are reassured. 

The return loss of the tapered meander line antenna on a comparably small ground plane is 
shown in Figure 3. From Figure 3, the first and the third resonance of this taper line is at 1.2 GHz 
and 2.7 GHz, respectively, which may be tuned as a 50-Q input impedance for current wireless 
application. Instead of using a 0.5-mm cell size, the computed return loss for the small ground 
plane case uses 1-mm cell size, which is small enough for effective simulation. To simplify the 
tuning design, the dual-sleeve method [2, 6] is adopted to improve the input impedance. Because 
the taper line is not a uniform segment ratio trace, the optimal sleeve length / is not necessarily X A 
of Lax, as suggested in [2]. Choosing a spacing of 1mm away from the longest horizontal 
segment, the effects of different sleeve length is observed in Figure 3(a). From Figure 3(a), the 
optimal sleeve length for lower frequency end is found when sleeve is 24 mm, 83 % of Lax- 

Since the optimal sleeve length is determined for the current antenna configuration, the 
following analysis is made to determine the best location for dual-sleeves. From Figure 3(b), the 
optimal return loss is found when the spacing between the longest segment and a sleeve is at the 
edge of the substrate, 3-mm. The case is operates at 2.55 GHz with 230 MHz bandwidth, which 
is effective for future wideband mobile phones and current wireless applications in the vicinity of 
this frequency. 

To analyze the case with the sleeve tuners are not connected to the ground; another study is 
conducted for dual floating printed lines. The lines are simulated by printed traces at the center 
of the tapered meander line trace. From Figure 4, the effects of floating traces are found to be 
trivial at frequencies below 5 GHz, but detectable above 5 GHz. From this study, the results 
indicate that the tuning effectiveness is greatly reduced when the dual sleeves are not well 
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grounded. 


Conclusions 

The reliability of the numerical model in this study is demonstrated by comparing FDTD 
simulations with prototype test results. A detailed investigation for optimizing the operating 
frequncy and input impedance of a taper meander monopole for wireless communication has 
been presented. The presented tapered meander line monopoles can be tuned for a broad 
bandwidth of 230 MHz, operation at 2.55 GHz, and for a 50 Q input impedance using printed 
tuning sleeves, which is appropriate for support of current and future generations of wideband 
wireless communication systems. Optimal results for lower frequency applications, between 1 
and 2 GHz, may be achieved by increasing the meander line trace segments or by employing 
other tuning methods. Future studies will be focused on finding simple tuning methods to tune 
the lower frequncy modes and the optimization of the radiation pattern parameters of the current 
design. 


REFERENCES 

[1] K. Noguchi, M. Mizusawa, T. Yamaguchi, and Y. Okumura, “Numerical analysis of the 
radiation characteristics of the meander line antennas consisting of two strips,” IEEE AP-S 
Digest, pp. 1598-1601, 1996. 

[2] M. Ali, S. S. Stuchly, and K. Caputa, “A wide-band dual meander-sleeve antenna,” Journal 
of Electromagnetic Waves and Applications, vol. 10, no. 9, pp. 1223-1236, 1996. 

[3] A. Z. Elsherbeni, J. Chen, and C. E. Smith, and Y. Rahmatt-Sami, “FDTD analysis of 
meander line antennas for personal communication applications,” Progress in 
Electromagnetics Research Symposium (PIERS), Cambridge, MA, 1997. 

[4] M. Ali, S. S. Stuchly, and M. Okoniewski, “Characterization of planar printed meander line 
antennas using the finite difference time domain technique,” IEEE APS Dig., pp. 1546-1549, 
1997. 

[5] H. Y. Wang and M. J. Lancaster, “Aperture-coupled thin-film superconducting meander line 
antennas,” IEEE Trans. Antennas Propagat., vol. 47, no.5, pp. 829-836, May 1999. 

[6] C. P. Huang, A, Z. Elsherbeni, J. J. Chen, and C. E. Smith, “FDTD characterization of 
meander line antennas for RF and wireless communications,” Progress in Electromagnetics 
Research Symposium (PIERS), vol. 24, pp. 185-200,1999. 

[7] J. P. Berenger, “A perfectly matched layer for the absorption of electromagnetic waves,” J. 
Computat. Phys., Oct. 1994. 


927 





Meander line 



Figure 1(a) Tapered meander monopole with dual 
sleeves on a small ground plane 



Figure 1(b) The prototype of a tapered meander line 
monop 



Frequency (GHz) 

Figure 2(a) Comparison of the computed and 
measured return loss of the prototyped 
tapered meander line monopole. 



Frequency (GHz) 


2(b) Comparison of the computed and 
measured phase of the reflection 
coefficient of the prototyped tapered 
meander line monopole. 
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Meander line 



Figure 4(a) Tapered meander monopole with dual 
floating lines. 


3(a) Return loss of the tapered meander 
monopole versus sleeve length. 



Frequency (GHz) Figure 4(b) Return loss of the tapered meander 

monopole versus floating line length. 


Figure 3(b) Return loss of the tapered meander 
monopole versus sleeve spacing. 
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Abstract - Theoretical models of modern plasma tools for surface processing need accurate 
electromagnetic computation for field prediction. A numerical code based on the FDTD method for 
electromagnetic Held computation at microwave frequencies inside a region non-uniformly filled by a 
magnetized collisional plasma has been validated by computation of the resonant frequencies for a 
circular cylindrical uniformly-filled cavity. A realistic model of an ECR camera, sourced by a Lisitano 
coil antenna, has been proposed. 


I. Introduction 

Surface processing by plasma is an established technical and industrial procedure since long 
time. The most common method is to excite the plasma with a 13.56 MHz radiofrequency source. 
In the last years a plasma reactor based on microwave absorption at the electron cyclotron 
resonance was developed. In this Electron Cyclotron Resonance (ECR) reactor, plasma is formed 
inside a magnetic field generated by a solenoid. The advantage of the ECR reactor is that the 
formation zone of the plasma can be away from the target where the surface to process is placed. 
Plasma streams down from the formation zone along the field lines to surface. This should allow 
better control of the process and of the geometry. Several versions of the ECR reactors are now on 
the market. The microwave frequency is generally 2.45 GHz, consequently the magnetic field 
intensity is about 875 Gauss and is slowly variable in space in order to control the absorption zone 
[ 1 , 2 ]. 

One of the research goals in the field of ECR is to improve the method in order to be able to 
process wafers having larger and/or non planar surfaces. Possible theoretical models are based on 
the analysis of electromagnetic wave propagation in a collisional plasma confined by a magnetic 
field in a rectangular or cylindrical cavity. The magnetic field is spatially varying so that the wave 
propagates along the field lines in regions where the wave frequency is slightly below the 
cyclotron frequency (whistler wave). In the process of propagation the wave approaches the 
resonance zone and it is absorbed. 

Cold plasma can be generated inside the cavity (Fig.l) by a Lisitano slotted coil antenna £31 
which excites an electric field pattern similar to that of the TEon mode in a cylindrical cavity. As 
additional advantage, the coil diameter can be determined independently of the microwave 
frequency. The plasma is excited within the coil not only near the slots but also in the central area. 
To obtain a unif orm field inside the coil, the radius r L compared with wavelength should be such 
that: X / 60 < < 3X . Moreover the Lisitano coil also works as a plasma ejector. 

The plasma dielectric tensor is a function of space and frequency as well as of plasma 
parameters and further computational complexity is originated for a realistic model by boundary 
conditions on non-simple surfaces. Therefore the electromagnetic wave propagation problem can 
only be solved by numerical tools. We use an extension of the FDTD method to the analysis of 
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gyrotropic media [4], 

In this paper the results of a numerical simulation for an ECR camera with a realistic model 
for the launcher are reported. Previously, the algorithm has been validated for a cylindrical cavity 
which was homogeneously filled by a collisional axially-magnetized plasma. For this structure 
resonant frequencies can be obtained in closed form, therefore a comparison with FDTD computed 
data can be easily performed. 



m ■ ■ 



Fig.l: ECR camera typical set-up with axial symmetry along z (left). A simplified scheme of a Lisitano coil 
antenna (right) showing the slot line 


n. Method 

The relationship between electric flux and electric field for a magnetized cold plasma is 
; D(©)/e 0 =e xx (©)£(©) +(©)-(©)]z■ £(©) + ye^ (©)£x£(©) (1) 

where it has been assumed that the biasing field is parallel to the z-axis; B 0 = zB 0 , © is the angular 
frequency and e is the dielectric tensor. The plasma angular frequency, ©^ and the cyclotron 
angular frequency, © ^ are respectively defined for the k - particle of a general cold plasma as 
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where Z k e m k and n k denote the particle charge, mass and density, respectively. Hereafter we 
shall consider a two-particle plasma (ion and electron) for which the tensor components are 
reported in [5]. The products in (1) correspond to convolutions in time. Hunsberger et al. [4] 
extended die standard FDTD algorithm to include the discrete convolution of the electric field 
with a time-domain susceptibility tensor x = j=-/ , (/ the identity matrix), for the case of pure 

electron modes in a cold plasma. In such a case the magnetic field is updated as in the standard 
FDTD scheme, while the electric field components are updated according to the following 
equations: 
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where the components of % take into account the recursive convolution. As further generalization 

we considered a ^-variable biasing field Bq(z) and a non-uniform plasma with density variation 
on transversal cross-sections. This generalization requires plasma parameters to be averaged over 
each cell. 

According to previous investigations [6], a slotted-type Lisitano coil with N longitudinal 
slots can be regarded approximately as an array of N thin z-oriented slot antennas, of length W y 
uniformly distributed over the surface of a cylinder (Fig.2). A simple model to account for slot 
radiation was therefore an annular array of sinusoidal magnetic line currents radiating in front of 
short-circuited slots. 

Z 


z 0 +W- 


Fig.2: Magnetic current model of a Lisitano coil 

This model can be easily implemented in an FDTD computer code. To this purpose, the azimuthal 
electric field in the slot placed at 4> = § n is: 



„ (rr. > 4>« 0 = 7C J6(r - )5(<}> - 4> n ZQ<z<z 0 +W (4) 

where Zq is the coordinate of the slot down-end in the FDTD grid and i(t) accounts for the time- 
domain dependence (a sinusoid or a gaussian pulse). The equivalent magnetic line current will be 
therefore: 


m n ( r L >*/ > *.0 = “£* (%, 40 ( 5 ) 

Furthermore, r L A$ «X, where A(f> is the angular distance between consecutive axial slots. 
Therefore , when the reverse direction of the current in two adjacent slots is considered, we 
conclude that the currents radiate in phase for any n. The expression in (5) can be directly 
inserted in the H z update equation at points of the slots: 
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Jo.*)=-ffi>^, ) (A).yo.*)+w«(-Ro»<l>».*^.(» f + o.5)a/) 

'0 A* = r L cos(<t>„ ), j 0 Ay = r L sin{if „) 

where the subscript “ FDTD ” is for the corresponding FDTD equation in homogeneous media. 


in. Validation 

The accuracy of the modified FDTD algorithm in modeling cylindrical cavities filled by 
cold plasma can be discussed by means of a test case involving the computation of resonant 
frequencies according to the procedure described in [7] for magnetized ferrites. The reference 
data can be obtained by means of a closed form procedure. Starting from the Maxwell’s equations 
for gyrotropic media, it can be demonstrated [5] that the characteristic equation for the resonant 
modes of a cylindrical cavity of length L and radius Rq is: 

t? fr •A.0*0V (0*0)+ p3 m (0*0 ) - 

b ba n\ 

m 

J'(pRo)]+—eb <tr+i: VmipXoWqXo)) 

aq bu 

where -* 2 ,e*=-e^,e c =e^A 2 .p 2 ==a + x ; ? 2 =c-x; 

X = a = b = -Ja>\i 0 kk 0 e b tj ; c = -kl e, tj. 

d = j®e 0 kk 0 e b e s i]. q = (^-e|)/(* 2 e 2 - e* e c ). k 0 =o/c 0f kko^fmlL^ 

A is an integer and (7) is solved for k Q . If n denotes the n -th zero of (7) for given h and m , then 
®h»m= c 0*0 is the corresponding angular resonant frequency. The limit s^-M) corresponds to 
zeroing off-axis tensor components. It is easy to show that (7) reduces to 

J’ m (R (8) 


The roots of (8) correspond to TE and TM resonant modes, respectively, and can be found 
in closed form. In this work, therefore, the modes are first computed in the above limit, then they 
are tracked on the complex plane [8] when the value of the off-axis components is restored. 

Numerical resonant frequency are computed by FDTD [9] in combination with another 
technique which, starting from the time response, will provide the resonant frequencies of the 
structure. The cavity is excited by an electric z-directed current element located on z-axis [10]. 
Therefore only rotationally symmetric modes will be excited. Gaussian time-dependence for 
wide-band analysis is adopted. Since resonant frequencies for plasma cavities can be very densely 
distributed in bands close to the plasma characteristic frequencies, the standard procedure, 
involving the Fourier transform of field time-response and peak detection, may result inaccurate. 
We applied an algori thm for parametric spectral estimation in time domain. In fact the FDTD late¬ 
time response (after the excitation has died down) of an eigenvalue problem can be expressed as a 
superposition of resonant modes 


Ef = lR t e 

1=1 
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where Rj is complex modal amplitude, a t is damping factor and a, is resonant angular frequency 
of the i -th resonant mode and M is twice the number of resonant modes. Resonance parameters 
can be computed via a pole-residue extraction procedure such as the Matrix Pencil algorithm [11] 
which does not require a Fourier transform. The procedure can be repeated for different 
electromagnetic field components and at different observation points. Finally, resonant frequencies 
can be obtained by averaging these data. 

The results of Tab.I have been obtained by the FDTD and Matrix Pencil method for a cavity 
with L=7.3 cm, J^=5cm, filled with a plasma having electron density n 0 ~5 10 u m‘ 3 , B 0 =1200 G, 
v c =0.3GHz. For comparison, the corresponding resonant frequencies which are obtained by the 
closed-form method, are also reported, showing a good agreement between the outcomes of the 
two procedures. 


TABLE I 

Cavity resonant frequencies 


mode 

f [GHz] 

(analytical) 

f [GHz] 
(FDTD) 

Error (%) 

EI91B 

2.759 

2.782 

0.85 

EESEIIB 

3.150 

3.120 

0.95 

ESSEBH 

3.560 

3.550 


KBEEBB 

3.681 

3.656 

0.67 | 

E535SBB 

3.847 


0.78 | 

BSBE3M 

5.872 

5.894 

0.37 


IV. Numerical Example 

As a realistic example, a camera with L~0.6m, Ro=0.lm was considered. The Lisitano coil 
parameters were: N~ 8, r L =\4cm, W=6.\cm. The frequency was 2450 MHz. Plasma parameters 
were = 2.510 17 m -3 , v c = 24.5 MHz . The biasing magnetic induction was zero within the 
coil and decrease linearly from 950 Gauss to 750 Gauss outside the coil. The camera and the coil 
were meshed on a (5mm x5mmx 2mm) -cell grid resulting in 530.000 voxels. Fig.3 shows the 
computed electric field on a vertical cross-section of the camera. An important local maximum 
appears along the camera axis at about 10cm from the Lisitano coil termination. This peak is due 
to the plasma resonance and, as expected, the field is strongly dumped hereafter. The object to be 
processed could be placed in the hot-spot area where the field is almost uniform around the axis 
(Fig.4a). Finally, the electric field within the coil is shown in (Fig.4b). It can be observed that the 
field exhibits small variation on the cross-section (approximately 5dB). Moreover the picture is 
not exactly symmetrical for the coil staircasing in FDTD modeling. 


V. Conclusions 

The FDTD method has been used for electromagnetic field computations at microwave 
frequencies inside a cylindrical camera inhomogeneously filled by a magnetized collisional plasma 
and excited by a slotted circular array. 

To validate the numerical method, computations were performed on a simple structure, i.e., 
a cylindrical cavity filled by a uniform axially-magnetized plasma. Comparisons among resonant 
frequencies obtained in closed form and by means of FDTD and Matrix Pencil method show a 
good agreement and a relative difference less than 1 %. A realistic simulation of an ECR camera 
with an FDTD model of the Lisitano coil launcher has been successfully performed, showing the 
electric field hot-spot which can be associated to plasma resonance. 
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FigJ: |EJdB on a vertical camera cross-section 



'a) b) 


Fig.4: |E|dB on two transversal cross-sections through a) hot spot area and b) Lisitano coil 
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Abstract 

This paper presents an efficient simulation tool for microwave and hybrid heating. Computational results illustrate the 
usefulness for such a tool. A new thermal modeling technique for the simulation of conductive and radiant heat transfer is 
presented. The conductive heat transfer is modeled by a finite difference algorithm. A finite difference scheme is not 
applicable for the radiant heat transfer, as radiation from a material surface is not bounded to the immediate vicinity as is 
the conductive transfer. Therefore ray optical methods are used. Rays connecting mutually visible surfaces are obtained by a 
new fast method. The algorithms are combined with an electromagnetic FDTD program to simulate the electromagnetic 
fields within ovens. Simulations are presented for an oven heated by microwaves, or by a combination with a conventional 
heating. 

Introduction 

Thermal modeling is mandatoiy for the optimization of heating processes in ovens. Especially in combination with 
microwaves, the heating process has to be carefully designed to achieve a fast and uniform heating. Nowadays, most 
thermal models for ovens include only heat conduction or simplified models for heat radiation. This simplification neglects 
the transportation of heat radiation from one object to another and only the emission of heat from a surface. This becomes 
questionable at higher temperatures. In feet, with increasing temperature radiant heat exchange becomes more and more 
important, since the energy emitted from a material surface increases proportional to the fourth power of temperature. In 
contrast energy transported by heat conduction is only proportional to the temperature. Radiant heat transfer eventually 
prevails conductive heat transfer. If ovens are designed for higher temperatures radiant heat transfer is not neglectable and an 
accurate model has to be used for simulation. 

For the op timizat ion of microwave ovens one needs to determine the dynamic heat process. Hence, a combination cf 
thermal and electromagnetic simulation must be used. Both radiation and the influence of varying material parameters due 
to increasing temperature must be considered. 

The Finite Difference Time Domain (FDTD) method has been found to be an excellent algorithm for the calculation cf 
electromagnetic fields, especially in closed structures like ovens. Narrowband and broadband calculations are easily 
performed. The FDTD method applied to the heat conduction equation is similar to the electromagnetic FDTD. A 
combination with an electromagnetic FDTD leads to a very efficient and powerful simulation tool. Self-consistent 
modeling is possible as well as analyzing dynamic heating processes. 

Both, the conductive thermal and the electromagnetic algorithm are of local character as the temperature and the fields in 
one discretisation cell is only related to its neighboring cells. When considering radiant heat transfer, energy may be 
transported through the whole computational space. Mutually visible surfaces, view factors and material parameters have to 
be determined and used for the calculation. This usually means high computational effort A very fast algorithm has been 
developed that is optimized for calculation of radiant heat transfer. 
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The mechanism and the modeling of the conductive and radiant heat transfer is shown and combined with an 
electromagnetic FDTD algorithm. The resulting code is applied to a single mode oven where a polyvinyl chloride (PVC) 
sample is heated. As this material is likely to produce hot spots in an electromagnetic field the microwave heating is 
combined with conventionally heated walls. The heat distribution within the sample for the different heat sources is shown. 

Heat Conduction 

The finite difference scheme [1] is applied for the conductive heat transfer to a rectangular non-uniform grid with cells 
indexed by (i,j, k). Inside a cell the temperature is assumed to be homogeneous. Heat flux density vectors describing the 
heat exchange with the neighboring cells are defined in the middle of the six cell surfaces as shown in Fig. 1 for the 
discretisation cell (ij,k) 



Fig. 1: Discretisation cell (ij,k) and the position of the Fig. 2: Geometrical properties for the calculation of the 
scalar field components T and the vectors j discretised field component j x 

The calculation of the temperature distribution is divided into two steps. The conductive heat flux densities are caused by 
the temperature gradients between neighboring cells and given by 

J(x, t) = -cr(x, f)grad(r(i, t)) (1) 

with j the heat flux density, and a the thermal conductivity. Considering only the x component of j in Fig. 1 as shown 
in Fig. 2 the gradient in equation (1) is expressed using the finite difference expression 


The thermal conductivity at x is the average of two neighboring cells. With 
\ * o(i,j,k)o(i-\,j,k) 

aX '‘ ~ o(iJ,k)l2 + a{i -1 J,k)l t 


one obtains 
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■'* W ' o(iJ,k)l 2+ a(i-lj,k)lA K 1 K,J ’ ” 


( 4 ) 


The thermal conductivity is assumed to be constant over the volume of one discretisation cell. As the heat flux density is 
supposed to be constant over the surface, the power P x flowing through the interface between the cells and (i~lj,k) is 
determined by 

dA * j x (iJ,k)AyAz . (5) 

A 

All other components of j and P are obtained accordingly. 

The next step is the application of the conservation of energy. The change of temperature within a discretisation cell is 
derived from the change of energy. When the conservation of energy 


divy(x, T) + Cj(x,r)p(x, T) 




At 


■ 0 


( 6 ) 


is applied to one cell (i,j,k) the divergence of j equals the power that flows through the surfaces of the cell. c s and p are the 
specific heat capacity and the density, respectively. The divergence of j is expressed by the discretised surface integral in 
equation (5). This leads to 


divy(x) =* - p x (U’k) - P x (i +1 ,j,k) + P y (i,j,k) - P y (i,j +1,*) + P z {iJ,k) - P z (i,j,k + 1) 


( 7 ) 


in the finite difference scheme. 

The right hand side term in equation (6) applied to the cell volume leads to 


c s (x, T )p(*’ T) AT [ X '’^ =s> fffc s (x, T)p(x, dv = Ax A ytszc s ( i,j , k) p(i,j, k ) Ar ^’ ^ 

V 


At 


and finally the time derivative is also expressed by a finite difference 

dljx) T n *'(i,j,k)-T n (iJ,k) 
dt At 

where T denotes the temperature at timestep n. Using equations (7), (8) and (9) one obtains 




{‘,j,k) = T n (i,j,k)l 


At 


Ax Ay Az c s (i t j, k) p(i, j , k) 

[P x {i,j,k) - P x (i + l,y, k) + P y (iJ,k ) - P y (i,j +1,*) + P z {i,j,k)-P z (iJ,k +1)] 

With equations (10) and (5) the temperature is updated at each time step. 


( 8 ) 


( 9 ) 


( 10 ) 


Radiant Heat Transfer 

Radiant heat transfer applies only to surfaces. Additionally, radiant heat transfer is not bounded to the immediate vicinity, 
as energy may be transported through gases without any interaction. The main tasks of the simulation is to determine 
mutually visible surfaces and geometric parameters. The algorithm is therefore quite different to the algorithm for the heat 
conduction above. 

With increasing time in the simulation the increase of temperature may cause a variation of the electric and thermal 
properties, but the geometry remains constant. Therefore geometry checks are performed only once before calculation starts. 
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The checks include the determination of radiant surfaces, testing for visibility and the calculation of view factors. 



Fig. 3: Radiation between two rectangular planes. 

The radiant heat transfer only applies to interfeces between solid and gaseous materials. Therefore the first step is to 
determine those surfaces within the finite difference grid. The surfaces of the solids are assumed to be rough. The Stefan- 
Boltzmann and Lambert equations determine for this case the specific heat radiation. In closed environments like ovens, all 
the radiated energy is absorbed by other surfaces. For the calculation of the radiant heat all energy exchanges between 
surfaces have to be calculated. As shown in Fig. 3 the radiation of surface Ai is partially absorbed by surface Aj. The 
absorbing surface Aj, however, also emits thermal radiation. A part of this radiation is in turn absorbed by Ai. The power 
exchanged between the two surfaces / and j is 

02 ) 


where 


Ei a j F}j ct, 7 f 
c j a ‘ Fji a j Tj 


(13) 


where a, and e, are the absorption and emission coefficient of surface A„ respectively. Kirchhoffs law states that a, equals 
Ei. The view factors Fy and Fjt are coefficients detennined by the geometry only. They express the part of energy that is 
transmitted by surface A,- and actually received by surface Aj, divided by the total energy emitted by surface A,-. The view 
factors Fij are given by 





dAjdAi 


(14) 


and are determined by geometrical properties as shown in Fig. 3. The factors have values between 0 for infinitesimally 
small surfaces infinitely separated and 1 for two parallel, infinitely extended planes. The F tj are constant with respect to 
temperature. 


The final temperature distribution is detennined by the energy exchange between all surfaces. The radiant power flowing 
through the surfaces of cell (ij,k) is now used to calculate the variation of the temperature. When applying the conservation 
of energy, equation (6), to the radiant heat transfer one finally obtains 


T n+l (i,j,k)=T n {i,j,k) + 


_Ar_ 

AxAyAzc 5 (ij, k) p(i,j , k) 


2 Prad 

Surface 


(15) 


Implementation 

For maximum versatility, the simulations are based on a cartesian, but non-uniform grid. This allows an enormous 
simplification for the determination of view factors. An efficient calculation is obtained if the geometrical properties are 
checked and the view factors are calculated and stored only once before the calculation starts. During the calculation the 
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stored values are simply processed in a look-up table. 

The preprocessing is divided into two steps. First the relevant cells are determined. Only cells representing solid materials 
bordered by a gas or vacuum are relevant as only those cells radiate or absorb energy. 

The next step is to determine surface pairs. Two surfaces are called a surface pair when they exchange energy. Energy 
exchanges only apply to surfaces that are mutually visible. Eveiy possible combination of surfaces has to be checked 
whether they are linked by a line of sight. Depending on the size and properties of the oven and the discretisation millions 
of possible surface pairs exist Hence, the visibility check has to be very efficient. 

Here a modified Bresenham algorithm is used which combines versatility and efficiency. This algorithm determines very 
fast the pixels that are penetrated by a line [2,3]. However, this algorithm has been developed for two dimensional uniform 
grids. Here the algorithm is extended to three dimensional non-uniform grids. 

For each surface pair the view factors Fy are calculated. Using the reciprocity 

A t Fij = Aj Fji (16) 

the view factors are calculated only once for each surface pair. As the surfaces are either parallel or perpendicular oriented in 
the rectangular grid, the integrals in equation (13) can be simplified. Hsu [4] showed a method for evaluating the view 
factor without integration making the calculation much faster. 


Microwave Heating 

The heating of materials by microwaves is determined directly using 

( 17 ) 

v 

where Pdt c is the microwave power dissipated. As shown in Fig. 4 there are twelve electric field components dissipating 
power within one cell. The electromagnetic power 

„ _ AxAyAzv* If |2 rial 




is determined by the conductivity k of the material . Combining (17) and (18) and setting up the finite difference scheme 
one obtains 




The computation of the electric energy dissipated in one cell is done by a standard FDTD scheme, and includes linear and 
frequency-dependent materials. 

Additionally, equations (10), (15) and (19) are combined to update the temperature in only one computational step. The 
radiant, conductive and electric power is summed up leading to 


T n *\iJ,k)=T%j,k)i 


Ax Ay Az c s (ij, k) p(i,j, k ) 


^ ^rad + ^ ^cond + ^ ^clec 

Surface Surface Volume 


Y _3 

Fig. 4: Placement of the electric field components within the discretisation cell (ij,k). 

Self consistent Simulation 

In the previous sections all material parameters are assumed to be constant over temperature. However, for most materials 
this assumption is not valid. The variation of material parameters like electric or heat conductivity is veiy important for the 
heat process. To obtain a realistic simulation of a heat process the variation has to be included. 

The variation of material parameters is slow in comparison to the variation of the electromagnetic field. In most cases it is 
not necessary to adapt the material parameters after eveiy calculation step. A good way to include variant parameters and 
restrict computational effort is to adapt parameters only if temperature has changed significantly. 

The combination of electromagnetic and thermal simulation can be done in two different ways: 

* Direct Coupling of the two methods: Thermal variations are much slower than electromagnetic ones. Therefore a direct 
coupling would lead to enormous computational effort, as calculations would have to proceed very long to get 
temperature variations. This can be avoided if the simulations use different time steps. For the electromagnetic 
simulations a rather small time step is needed to fulfill the stability criterion for this algorithm. The timestep for the 
thermal FDTD code may be much greater. For e.g. simulations at 2.45 GHz the timestep for the electromagnetic code 
is approximately 10 ps. For thermal simulations the timestep is in the region between milliseconds and seconds. 
Using different timesteps causes an unphysical acceleration of the thermal processes and may lead to inaccurate results. 
Therefore this technique has to be handled with care. 

• Separation of the two methods: This means electromagnetic and thermal calculations are done separately. First the 
electromagnetic field is calculated and assumed to be constant for the thermal simulation started afterwards. When 
temperature and therefore parameters change significantly, the thermal simulation is interrupted and the electromagnetic 
field is determined again with the adapted parameters. 

Both of these methods involve advantages and disadvantages and for a given problem only one method may be applicable. 
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Investigation of Heating Processes 

To show the applicability of this method a small single mode oven is used. In this oven a polyvinyl chloride (PVC) 
sample is heated. Fig. 5 shows the oven and the placement of the sample. PVC has a very low thermal and electric 
conductivity. Additionally, the electric conductivity is temperature dependent and increases with temperature. In 
microwave ovens thermal hot spots arise in such materials. Areas within those materials with higher electromagnetic 
fieldstrength become warmer and losses increase. This causes a higher energy absorption in those areas and the temperature 
increases faster. The thermal conductivity is too low to compensate this effect. 

Microwave Feed 



Fig. 5: Single Mode Applicator for heating PVC samples 

Fig. 6 shows the calculated electric field distribution in a ] 
that a standing wave arises causing a strong maximum 
distribution within the sample one can see the resulting ho 
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to the cooler surface and compensates the missing absorption of radiated heat by that wall. The results are shown in Fig. 9. 



Fig. 8: Temperature Distribution -within the sample if the Fig. 9: Temperature Distribution within the sample. The 
walls are heated conventionally (t~60 s). sample is placed 10 mm away from the front wal 

(t-60 s)l. 

Conclusion 

A new method was presented to simulate heating processes including conduction and radiation. The algorithms are 
implemented in an electromagnetic FDTD-code, allowing the calculation of microwave and conventionally heated ovens. 
For the conductive heat transfer a finite difference scheme is shown. Ray optical methods are introduced to check visibility 
of surfaces and consequently the possibility of radiant heat exchange. The resulting simulation tool is very powerful and 
helpful to design ovens. As shown in an example microwave ovens cause hot spots in materials with low thermal 
conductivity and electric losses that increase with temperature. In hybrid ovens microwave and conventional heat sources 
can be adjusted and hot spots can be minimiz ed. Temperature distributions can be determined inside a material to optimise 
the heating pattern. As a determination by measurements needs much more effort and is difficult to realise, the program is 
an excellent tool to perform heating adjustments and offers an excellent opportunity to simulate heating processes. 
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Abstract: In this paper, the authors present a new and robust conformal Finite Difference Time 
Domain (FDTD) algorithm for the accurate modeling of perfectly conducting objects with curved 
surfaces and edges. We illustrate the application of this approach by analyzing a number of 
representative antenna and cavity problems. These include a quarter wave monopole mounted on a 
perfectly conducting elliptic disk, a circular patch antenna, and a cylindrical cavity. We validate the 
method by comparing the results for the pattern, impedance and resonant frequency, etc., with those 
derived by using other techniques. 


I. INTRODUCTION 

The finite difference time domain method (FDTD), though it has been widely used to model 
various electromagnetic phenomena [1,2,3,4], continues to present a challenge when used to analyze 
structures with curved surfaces and edges. Although, in the past, there have been many attempts to 
address this problem via the use of conformal FDTD methods [2-19], the problems of mesh generation 
and instabilities have continued to plague most of these approaches. In this paper we present a 
generalization of the conformal FDTD method, which substantially preserves the conventional FDTD 
update algorithm as introduced by Yee [1]. We show, through several examples, that the present 
CFDTD approach is accurate, numerically efficient, and stable. 

We begin by providing a brief review of the various techniques that have been employed for 
analyzing objects with curved surfaces. These include: 

(i) Methods based on a globally-distorted and body-fitted grid model 

These methods use non-Cartesian grids that conform to smoothly-shaped structures [6,7], and are 
limited in their application to special classes of geometries. 

(ii) Hybrid methods 

These approaches combine the FDTD method with other numerical methods, such as the Finite 
Element Method (FEM) [8,9] and the Method of Moments (MoM) [10]. In these approaches, the 
problem of interfacing the FDTD regions with the MoM or FEM domains in a manner such that 
spurious reflections are minimized and the solution is stable, is a challenging one and must be 
addressed to obtain satisfactory results. 
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(iii) Locally distorted grid models 

These methods have the advantage of preserving the basic Cartesian nature of the grid in the entire 
computational domain [11-17]. However, they frequently suffer from the problems of late time 
instability and difficulties with mesh generation. 

II. THE CONFORMAL FDTD APPROACH 

Let us now describe briefly the conformal FDTD technique presented in this paper. It begins by 
dividing the cells, that are partially-filled with metallic conductors (assumed to be PECs), into two 
parts, viz., the inside and outside subregions (see Fig. 1). The E-fields on the edges of the inside 
subregions are set to zero, whereas they are updated by using a modified algorithm in the outside 
subregions. 

The magnetic fields inside the partially-filled cells are updated by using a slightly modified 
form of the conventional FDTD algorithm which accounts for the deformation of the cell. Unlike past 
approaches, however, we no longer employ the deformed cell area for updating the H-field-but use 
the entire cell area instead (see Fig. 2)--and this serves to eliminate the instability problems 
experienced in the past. The magnetic field component H y is written as 




At 




Az(i, j, k)E" (i, j,k)~ Az(; -1, j, k)E" (i -1, j, k) 
dx(i)x dz(k) 

( Ax(), j, k - l)g; (/, j, k- 1) - Ax(/, J, k)E" x fc j, k) 
<&(/)x dz(k) 


( 1 ) 


In the next section we present the numerical results for a number of representative problems to 
illustrate the application of the method just described above. 




Fig.2 Intersection between the FDTD mesh and a PEC 
in the x-z plane 
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III. NUMERICAL RESULTS 


To illustrate the versatility of the modified Finite Difference Time Domain program described 
above, referred to herein as the CFDTD code, we now present the numerical results for three different 
problems. They include: (i) the computation of the far-zone pattern for a quarter-wave monopole 
mounted on a perfectly conducting elliptic disk; (ii) evaluation of the impedance of a circular patch 
antenna; and, (iii) computation of resonant frequencies for a cylindrical cavity. In all of these cases, a 
10-layer unsplit PML is used to truncate the FDTD domain, and the time step is chosen to be 


At = 


0.995 



( 2 ) 


3.1. Monopole on elliptic and circular ground planes 

First, we consider a quarter-wave monopole, mounted on a perfectly conducting elliptic disk, as 
shown in Fig. 3. The accuracy of the fields in the shadow region of the monopole depends on our 
ability to model the ground plane accurately. 

The major and minor axes of the elliptic disk (b and a) are assumed to be 1.7 and 1.5 meters, 
respectively. The excitation source is a Gaussian pulse with a 3 dB cutoff frequency of 300 MHz, 
modulated by a sine function. The antenna is excited at the gap between the monopole and the ground 
plane. The FDTD domain is discretized into 79 x 87 x 30 cells, with a uniform mesh, whose 
dimensions are: dx(i) = 0.05 m, dy(j) = 0.05 m and dz(k) - 0.05 m. The normalized far zone field 
patterns for <j> = 0° and § = 90° at a frequency of 300 MHz are shown in Fig. 4 and Fig. 5, respectively. 
We observe that the CFDTD results are in good agreement with those obtained by using the MoM 
technique for both cuts. 

Next we change the disk shape to be a circular one with a radius of 0.62 m, and use a non- 
uniform version of the CFDTD to calculate the far zone field pattern of the same monopole antenna at 
1 GHz (see Fig. 6). We observe, once again, that the conformal FDTD results are in good agreement 
with those reported elsewhere in [21,22]. 

3.2 Circular patch antenna 

Next, we compute the impedance of a microstrip patch antenna, circular in shape, as shown in 
Fig. 7. The computational domain for this problem is subdivided into 55 x 44 x 10 cells. The cell 
dimensions are chosen to be 0.002 m, 0.002 m and 0.000795 m , in the x, y and z directions, 
respectively. The simulation is run for 25,000 time steps (though 15,000 were found to be adequate), 
with a time step of At = 2.29847 ps, and the solution is found to be entirely stable. The impedance of 
the circular patch antenna versus frequency is shown in Fig. 8. The results reported in [2,18,19] via the 
application of the symmetric and asymmetric PGY approaches, and those obtained by using the MoM 
[22], are also plotted in the same figure for the sake of comparison. The CFDTD results are seen to 
compare quite favorably with that obtained via the MoM; however, both the PGY (symmetric) and 
PGY (non-symmetric) methods exhibit a slight downward shift in the resonant frequency. We also 
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point out that in the PGY simulations a time step of At = 0.725 ps [2] was used, which is one-third of 
that needed in the CFDTD simulations. 


3.3 Cylindrical cavities 

As a final example, we turn to the problem of computing the resonant frequencies of the 
dominant TE and TM modes of a circularly-cylindrical cavity. The dimensions of the cylindrical 
cavities and results are presented in Table 1, along with the dimensions of the cavity, with R and H 
denoting the radius and height, respectively. The spatial discretization for this problem was chosen to 
be 0.005 m , and 5000 time steps were used to derive the results. Once again, we observe very good 
agreement between the new CFDTD results and the analytical solutions. In addition, no instabilities 
were observed, for this closed region problem, even though the program was tested up to 40,000 time 
steps. 


TABLE 1. Resonant Frequencies of Cylindrical Cavities 



Methods 

Geometry I 
RxH- 

(O.lmxO.lm) 

Geometry II 
RxH= 

(0.1mx0.08m) 

Geometry III 
RxH= 

(0.1mx0.06m) 

Geometry IV 
RxH= 

(0.06mx0.06m) 

TEin 

CFDTD 

1.734 

2.062 

2.639 

3.133 

Analytic 

1.738 

2.071 

2.650 

3.148 

Error 

0.23% 

0.43% 

0.42% 

0.48% 

TMon 

CFDTD 

1.88 

2.194 

2.742 

2.872 

Analytic 

1.889 

2.199 

2.751 

2.897 

Error 

0.48% 

0.23% 

0.33% 

0.86% 



Fig. 3 Geometry for the radiation from a X/4 
monopole on an elliptic disk 
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Fig.4 Normalized total electric field pattern for a 
A/4 monopole on an elliptic disk with a=l .5 m and 
b-1.7 m at a frequency of 300 MHz. 


Phi=90 degrees 



Fig.5 Normalized total electric field pattern for a 
A/4 monopole on an elliptic disk with a=1.5 m 
and b=l .7 m at a frequency of 300 MHz. 


270 



- CFDTD • • • MOM 

Fig. 6 Normalized total electric field pattern for a A/4 monopole on a circular disk 
with a=b= 0.61 m at a frequency of 1 GHz. 
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Fig. 7 Microwave coupled circular patch antenna. 
The microstrip is printed on a 1.59 mm substrate of 
Sf = 2.62 over a ground plane, and the patch 
antenna is printed above a 1.59 mm superstrate of 
s r = 2.62. (a) top view; (b) side view. 



Fig. 8 Comparison of the reflection loss for a microstrip 
coupled circular patch antenna computed using CFDTD, 
MoM, and PGY. 


REFERENCES; 

[1] K. S. Yee, “Numerical Solution of Initial Boundary Value Problems Involving Maxwell’s 
Equations in Isotropic Media,” IEEE Trans. Antennas and Propagate vol. AP-14, pp. 302-307, 
May 1966. 

[2] A. Taflove (editor), Advances in Computational Electromagnetics: The Finite-Difference Time- 
Domain Method , Boston, MA: Artech House, 1998. 

[3] T. Itoh (Editor), Time-Domain Methods for Microwave Structures , IEEE Press, 1998. 

[4] A. Taflove, Computational Electromagnetics: The Finite-Difference Time-Domain Method , 
Boston, MA: Artech House, 1995. 

[5] J. P. Berenger, “A Perfectly Matched Layer for the Absorption of Electromagnetic Waves,” J. 
Comput. Phys ., vol. 114, pp. 185-200, 1994. 

[6] M. Fusco, “FDTD Algorithm in Curvilinear Coordinates,” IEEE Trans. Antennas and Propagat ., 
vol. AP-38, no.l, pp. 76-89, January 1990. 

[7] J-F. Lee, R. Palandech, and R. Mittra, “Modeling Three-Dimensional Discontinuities in 
Waveguides Using a Nonorthogonal FDTD Algorithm,” IEEE Trans. Antennas and Propagat 
vol. AP-40, no.2, pp. 346-352, February 1992. 

[8] K. S. Yee and J. S. Chen, “The Finite-Difference Time-Domain (FDTD) and the Finite-Volume 
Time-Domain (FVTD) Methods in Solving Maxwell’s Equations,” ” IEEE Trans. Antennas and 
Propagat ., vol. AP-45, no.3, pp, 355-363, March 1997. 

[9] Monorchio, A., and R. Mittra, “Time-Domain (FE/FDTD) Technique for Solving Complex 
Electromagnetic Problems,” IEEE Microwave and Guided Wave Letters, vol. 8, no.2, pp. 93-95, 
February 1998. 

[10] A. R. Bretones, R. Mittra, and R. G. Martin, “A Hybrid Technique Combining the Method of 
Moments in the Time Domain and FDTD,” IEEE Microwave and Guided Wave Letters, vol. 8, 
no.8, pp. 281-283, August 1998. 


949 
































[11] T. G. Jurgens, A. Taflove, K. Umashankar, and T. G. Moore, ‘‘Finite-Difference Time-Domain 
Modeling of Curved Surfaces/' IEEE Trans. Antennas and Propagat., vol. AP-40, no. 4, pp. 
357-366, April 1992. 

[12] T. G. Jurgens, and A. Taflove, “Three-Dimensional Contour FDTD Modeling of Scattering from 
Single and Multiple Objects,” IEEE Trans. Antennas and Propagat ., vol. AP-41, no. 12, pp. 
1703-1708, December 1993. 

[13] J. B. Schneider, and K. L, Shlager, “FDTD Simulations of TEM Homs and the Implications for 
Staircased Representations,” IEEE Trans. Antennas and Propagat ., vol. AP-45, no. 12, pp. 1830- 
1838, December 1997. 

[14] Y. Nishioka, O. Maeshima, T. Uno, and S. Adachi, “FDTD Analysis of Resistive-Loaded Bow- 
Tie Antennas Covered with Ferrite-Coated Conducting Cavity for Subsurface Radar,” IEEE 
Trans. Antennas and Propagat ., vol. AP-45, no. 12, pp. 970-977, December 1997. 

[15] C. J. Railton, and J. B. Schneider, “An Analytic and Numerical Analysis of Several Locally 
Conformal FDTD Schemes,” IEEE Trans. Microwave Theory Tech., vol. 47, no.l, pp. 56-66, 
Jan. 1999. 

[16] S. Dey and R. Mittra, “A Locally Conformal Finite Difference Time Domain (FDTD) Algorithm 
for Modeling 3-D Objects with Curved Surfaces,” IEEE AP-S International Symposium , 
Montreal, Canada, 1997. 

[17] S. Dey and R. Mittra, “A Locally Conformal Finite Difference Time Domain (FDTD) Algorithm 
for Modeling Three-Dimensional Perfectly Conducting Objects,” IEEE Microwave and Guided 
Wave Letters , vol.7, no.9, pp. 273-275, September 1997. 

[18] S. Gedney, F. Lansing, “A parallel Plane Generalized Yee Algorithm for the Analysis of 
Microwave Circuit Devices,” J. Numerical Modeling, (Electric Networks and Fields), vol. 8, 
1995, pp. 265-276. 

[19] S. Gedney, F. Lansing, and D. Rescoe, “A Full Wave Analysis of Passive Monolithic Integrated 
Circuit Devices Using a Generalized Yee Algorithm,” IEEE Trans. Microwave Theory Tech, 
vol. 44, 1996, pp. 1393-1400. 

[20] Wenhua Yu, and R. Mittra, “A Technique for Improving the Accuracy of the Non-uniform Finite 
Difference Time Domain (FDTD) Algorithm,” IEEE Trans. Microwave Theory Tech., vol. 47, 
no. 3, pp. 353-356, March 1999. 

[21] P. A. Tirkas and C. A. Balanis, “Finite-Difference Time-Domain Method for Antenna 
Radiation,” IEEE Trans. Antennas and Propagat., vol. AP-40, no.3, pp. 334-340, March 1992. 

[22] C. A. Balanis, Advanced Engineering Electromagnetics, John Wiley & Sons, New York, 1989, 
chapters 13 and 14. 

[23] Zeland Software, 39120 Argonaut Way, Suite 499, Freemont, CA 94538. 


950 






A finite-difference algorithm for modeling of conductive wedges in 2D 

Piotr Przybyszewski* 


Abstract 

This paper presents a fast finite-difference approach to the analysis of 2D structures containing conductive 
wedges. Such wedges may lead to singularity of some of the field components causing the local truncation error 
of the standard finite-difference methods to be unbounded and leading to high inaccuracy of the global results. 
The new method is expressed in the form of modification of two simple basic matrices. Such an approach gives 
a common method for discretization of various electromagnetic problems, since the basic operators allow one to 
represent Maxwell’s equations. The new algorithm for modeling of conductive wedges supports inhomogeneous 
domains and unlike other algorithms found in the literature, makes no assumption about positioning of the edge 
with respect to Yee’s cell. The numerical tests performed to validate the new approach give very good results 
in all range of parameters. 


1 Introduction 

Finite difference techniques using Yee’s mesh are widely used for solving electromagnetic problems. However, one 
of the assumptions of these methods is that the variation of field within Yee’s cell is not significantly different from 
linear. This condition is not satisfied in the vicinity of conductive wedges, where some of the field components 
may become singular [1]. This causes the local truncation error to be unbounded and leads to high inaccuracy of 
the global results. Various solutions of this problem were published in the literature. However, all of them have 
limitations. They assume the central location of the edge within Yee’s cell [2] [3] and/or support only wedges placed 
in the homogeneous domain [4]. In this paper, we present a new technique which does not have these limitations. 
The new approach gives significant reduction of error associated with the singularity for an arbitrary location of 
the wedge within the cell. Moreover, the domain in the vicinity of the wedge may be inhomogeneous. The new 
methods are implemented in the form of modification of basic matrices. All these matrices arise from discretization 
of basic operators, such as transverse gradient V*(.), transverse divergence V* ■ (.) or operator i z x (.). Such an 
approach gives a common method for discretization of every differential problem in 2D electromagnetics, since the 
basic operators are directly related to Maxwell’s equations. This method of discretization is similar to the one 
proposed in [5] but uses different basic operators. 


2 Formulation of 2D electromagnetic problems 


In this paper, we assume that the analyzed structure is lossless, isotropic and uniform in direction z. We decompose 
the fields into transverse (indiced with t) and longitudinal (indiced with z) components. Maxwell’s equations may 
be written as follows. 


I" —jfiz'Z'm 
L —DteZ m 
[ -tfz Ze 

[ ~D tm Z e 


■Z m Gt m 1 


' ht ‘ 

0 \ 


! h z 

— Z e G te 


r e t ' 

0 


U j 


Ett 0 €t 

0 E* z _ & z 


* Mtt o' 

’ fH * 

0 

. h * . 


( 1 ) 

( 2 ) 


p z is the propagation constant, w is the angular frequency, e and h denote, respectively, electric and magnetic 
intensity fields. All the basic operators are written in a symbolic form. They have dual forms depending on 
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the fields they are acting on. This allow one to introduce the boundary conditions to the operators and treat 
the boundary value problem in a uniform way. This gives us a common approach for the operator equations 
in continuous space and matrix equations in discrete space. The following table presents relations between the 
symbols, continuous operators and matrices: 


Continuous space 

v.-O _ 

V,(.) 

Z z X (.) 

ht 

*zz 


U’ZZ 

Symbol 

D te 

Utm 

Gte 

Gt m 

Z e 

z m 

Ett 

E« 

Mtt 

m tz 

Discrete space 

Q-tt 

EL. tm 

&te 

Q. tm 



Ett 

gzz 

Ktt 

Kzz 


2.1 Properties of basic operators 

Analysis of basic operators in continuous space in eqs. (1) and (2) leads, for lossless structures and nonradiative 
modes, to the following properties: 

Dt. = -G£ (3) 

D tm = -GL (4) 

Ze = -Z-> = -Z£ (5) 

In the lossless domain, the operators associated with material tensors should be hermitian symmetric: 

E« = E» E„ = B® M« =Mg M« = M® (6) 

Furthermore, from the property V • V x (.) = 0, we get: 

DnnZeGte = 0 (7) 

D te Z m G tm = 0 (8) 

If the domain is isotropic and homogeneous, we have V 2 = (e/i)~ ! , where v is the wave speed. This implies: 

E t - t 1 Z m M^ l Z c = u 2 I tt (9) 

where I tt is the identity operator defined on the transverse fields. This condition guarantees that the propagation 
constant and the frequency are related to each other by the following equation: 

u 2 - sr v 2 p 2 (10) 

where o?o is the cutoff angular frequency of the corresponding mode. 

All the properties presented above may be seen as constraints which have to be fulfilled by the discrete operators. 
However, most of them will be satisfied in a natural way, as a consequence of the field behavior and properties of 
Maxwell’s equations and Yee’s grid. 

2.2 Electromagnetic eigenproblems 

Various eigenproblems may be formulated based on eqs. (1), (2). Here, we point out only a few examples, fi 2 
formulation for the transverse electric intensity field components may be written as follows: 

/?fef = — a; 2 Z m MttZ e Ettet — ZmMttGtmM^DtmZee* 

+ GteE-'DteEtte* (11) 

At cutoff, we get two scalar eigenproblems dealing with, respectively, modes E and H: 

D te Z m M^ Z e G te c, = u 2 e z (12) 

M' 1 Dt m ZeE t " t 1 Z m G tm h z = u 2 h z (13) 

Analogously, we may write two scalar eigenproblems for the static case. By discretizing separately the basic 
operators rather than full equations, we get a common approach to the discretization of eigenproblems including 
the ones presented above. 
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Figure 1: Fragments of 2D Yee’s grid corresponding to the domains and contradomains of matrices 22 £e (a) and 

Zt o>). 


3 Discretization of basic operators 


Fragment of matrix 22 £c corresponding to nodes in fig. 1(a) may be written as follows: 




i i 
Ay Ax 


1 1 1 
”2* “Si J 


(14) 


In a similar manner, we may define matrix 22tm- Matrices corresponding to transverse gradient, have the form 
Qte — —Qje satisfying conditions (3) and (4). Fragments of matrices J[ e , corresponding 

to the nodes in fig. 1(b) have the following form: 




10 0 0 
0-100 
0 0 10 
0 0 0 -1 


-10 0 0 

0 10 0 

0 0-10 
0 0 0 1 


(15) 


Material matrices corresponding to the material tensors are in the homogeneous parts of the domain simply the 
values of the appropriate material constant at the corresponding point. 

Further on, we define local algorithms for modeling of conductive wedges, arbitrarily located electric walls and 
boundaries between dielectrics. All these algorithms are written in the form of modification of material matrices. 
Such an approach guarantees the right interpretation of the fields. 


3.1 Conductive wedges 

Conductive wedges may lead to singularity of some of the field components in the vicinity of the edge. This cause 
the local truncation error to be unbounded and leads to high inaccuracy of the global results. Here, we present the 
new algorithm for modeling of such structures reducing this kind of errors. A general idea is to modify matrices 
Qu, M.tt so that the field behavior is correctly represented by the standard grid equations. 

General algorithm based on cutoff solutions We expand the longitudinal field components in the region of 
singularity into a series of functions: 

e z = 'y ] Cei&zci h z — ^ ^ Chi^xci ( 15 ) 

i i 

where c e », Chi are unknown coefficients. Functions e z «, h zc i are the solutions of cutoff scalar equations, respectively 
(12) and (13), for cj = 0 in the vicinity of the singularity. Let us concentrate on field e z . Limiting the sum in (16) 
to the first few terms and writing it for a few grid points e z we may present it in the following matrix form: 

— S.ecQe ( 12 ) 

Further on, we assume, that the number of elements in the sum is equal to the number of grid points, and hence 
22 ec is a square matrix. We also assume, that the location of these points does not lead to singularity of the matrix. 
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Figure 2: Conductive wedge in an inhomogeneous domain: general case (a), wedge symmetrically placed between 
two dielectrics (b) } nodes used in the correction algorithm for the wedge of angle a = 0 at 4> o = 180° (c). 

Element ij of matrix R ec is equal to the value of function e ZC i of series (16) at grid point e z y. 

Becij ~ &zci\ tz . (18) 

Let us now write Maxwell’s cutoff equation for field b t in terms of field e z : 

—jicbt — —Z_ e Q_t e e z (19) 

Inserting eq. (17) into (19) and inverting the operator in order to express vector c e in terms of vector 6*, we get: 

Ce = -jui&QLZmkt (20) 

Operation (.) J is a quasi-inversion and will be discussed in one of the next paragraphs. At this stage of discussion, 
we may assume that it is equivalent to inversion. Vector ht may be expressed in terms of c t as follows: 

—jttht — G.tmA.ttcQc ( 21 ) 

Matrix Ate in this equation converts coefficients c ei into indefinite integrals. Further on, matrix g* m converts 
them to the definite integrals. Element ij of matrix A ec is the value of appropriate indefinite integral of function 
e ZC i at grid point h z y. 

Atcij ~ I W-Czcidy =s — I ~—~e zc idx 
J Hzxdx \ haj J Hyydy 

Putting eq. (20) into (21), we get: 

kt = £tmA"!l:!glg m b t (23) 

The above equation shows relation between fields ht and 6*. This defines operator M^ 1 : 

Mu' = gtmA'cRZ'ateZm (24) 

In the same manner, we construct matrix E_t t l : 

I« = -guAhcR^aLZ' (25) 

Analogous procedure may be performed when constructing the basis from static equations for /?* = 0. This leads 
to definitions of matrices g_tt and M t.t, which are equivalent to the equations given above. 

Symmetrization of matrices and Mtt We should note, that operators ( or J£tt, Met) defined 

this way are not symmetric, and hence do not satisfy condition (6). Fortunately, the algorithm is not sensitive 
to small deformations of Yee’s cells in the vicinity of the wedge. Therefore, we may choose the locations of the 
appropriate nodes in the way leading to symmetric operators. This may be also achieved in a simpler way, where 
we sy mm etrize nonsymmetric matrices as follows: 

g<-0.5 (g+£ T ) (26) 

where Q_ denotes the operator to be symmetrized. In order to not disturb condition (9) for homogeneous domains, 
we should sy mm etrize only one of the pairs MSu or M t u j=jtt * 
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Quasi-inversion of matrices Q_.teiG.tm Expressions describing matrices Ru,M.tt include inversions of matrices 
Q_tc and G_tm- Unfortunately, in many cases, these matrices cannot be inverted, because they are singular. Let 
us note, however, that zero eigenvalue of these operators corresponds to one of the static modes t z \ = const or 
h Z i = const. These modes do not have the transverse field components, and hence have no influence on matrices 
Eft and M ++. Therefore, we may assume that these modes do not exist in the global field. In this case we may 
remove zero eigenvalue as follows: 

a'tm^gtm+htohl (27) 

where h z o is nonzero eigenvector corresponding to zero eigenvalue, and h* 0 is an arbitrary nonzero vector in space 
{/if}. Operator is not singular and may be inverted. We call operator 2tin 1 the quasi-inversion of Q_tm 

(gL= gi?). 

Conductive wedge When a conductive wedge is the source of singularity (fig. 2(a)), series (16) takes up the 
form [1]: 

oo oo 

e z = £ a v c * rVi 8in(vi<f>) h * = ^2 othjChir Vi cosfafy (28) 

i=l i=0 

where a e j, a^j , V{ have to be chosen in a way satisfying all boundary conditions. The following table presents 
definition of basic operators for the conductive wedge placed symmetrically between two dielectrics (figs. 2(b,c)). 
One may note, that this includes a very common case of a thin metal strip placed on a dielectric boundary, which 
may be seen as a conductive wedge of angle a — 0. 



3.2 Modeling of boundaries 

Boundaries between dielectrics or magnetics and electric and magnetic walls may be modeled by setting appropriate 
effective permittivities and/or permeabilities on the diagonal elements of material arrays corresponding to nodes 
close to the boundary. The equations for the effective material constants are based on the integral interpretation 
of the finite difference methods and are given below without derivation. These equations are needed to perform 
the tests presented in sec. 4, Since, in this paper, we want to concentrate on modeling of conductive wedges, the 
reader may skip the rest of this section. 

Boundary between dielectrics Fig. 3(a-c) presents three cases of location of dielectric boundary with respect 
to the electric field within Yee’s cell. When the electric field is parallel to the boundary (fig. 3(a)), the effective 
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Figure 3: Modeling of boundaries. Electric field parallel (a), normal (b) and at an arbitrary angle with respect to 
the dtelectnc boundary (c). Electric wall crossing 2D Yee’s grid. Nodes H x , E x , E y (d) and E Zt H x , H v (e). 


permittivity may be calculated from the following equation: 

€l 5 i + €2*^2 

*11 =- 5 - 

In the case of the field normal to the boundary (fig. 3(b)), we get the following formula: 


(29) 


S 

ii. _l 
*1 «2 


(30) 


In a more general case, when electric vector is placed at an arbitrary angle with respect to the boundary (fie 
the effective permittivity may be expressed by €jj and cjl: 


3(c)), 


= €j_ sin 2 <f> + e|| cos 2 (f> ( 31 ) 

where (f> is the angle between the boundary and x field component. 

In a similar manner, we may express the effective permeability g. t g at the boundary between two magnetics. 

Electric walls Looking at Yee’s cells crossed by an electric wall in fig. 3(d,e), we may note, that they are 
eformated with respect to the standard cells. This means, that there appear different integration paths and 
surfaces than m the standard case in the integral interpretation of the finite difference methods. This difference 
may be carried out by introducing appropriate effective permittivities and permeabilities at the cells close to the 
boundary. We may note, that there are three types of cells crossed by electric plane, namely A, B and C in 
fig. 3(d e). Node D is of the same type as node A, node E is a standard Yee’s cell which requires no modification 
and cell F corresponds to cell B. The effective material constants for all three types of cells are presented in the 
following table: 


Node 

Nodes H z ,E x ,E y 

Nodes E Z ,H X ,H V 

A 

ff ~ € nf h. 

ii 

V 

Si 

B 



C 

, „ _ _ Ax 

-- eg ~ € Ax+6 



4 Numerical results 

In order to present the abilities of the approach presented in the previous sections, the new algorithms were 
implemented in the form of a computer code and used for computing of the cutoff frequencies of an inhomogeneous 
fine with a single conductive wedge shown in fig. 4. The following table compares the results for E x and Hi modes 
for the algorithm without and with correction for the conductive wedge presented in sec. 3.1, for the case a = 0 
9o = 0 and the edge comer located in the center of Yee’s cell. 
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Figure 4: Tested structure (a) and the same structure placed in Yee’s grid: nodes E z , H x , H y (b) H z , E x , E y (c). 
ei = 4e 0 . Hi - Ho- 



Standard | 





160x144 

msm 

Boa 

Ei 

38.04 

1.06% 

37.85 

0.57% 

37.75 

0.30% 


37.64 

37.58 

-0.17% 

Hi 

14.01 

-2.80% 

14.22 

-1.39% 

Hi 

WEMM 

14.42 

14.39 

-0.16 


In the standard algorithm the error for the coarsest grid 20 x 18 is —2.80% for the H\. When the new algorithm 
is applied, the coarsest grid leads to an error of -0.16%, 

To show the behavior of the algorithm when the edge corner is not located in the center of the cell, a series 
of simulations was carried out for the same structure but the 20 x 18 grid translated in the x,y space by the 
normalized distance —0.5 < XtjAx < 0.5, —0.5 < yt/Ay < 0.5. The contour plots showing the relative error for 
the algorithm with and without correction are shown in figs. 5, 6. For the standard method the largest error is at 
the level of 2.5% ( E\ mode) or 8% (Hi) while the modified algorithm reduces this error to 0.4% and 0.6% in the 
worst case. 
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Figure 5: Contours of relative error [%] of the cutoff frequencies of E modes versus normalized wedge location 
within Yee’s cell for the finite difference algorithm without (a) and with (b) correction. 
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Figure 6: Contours of relative error [%] of the cutoff frequencies of H modes versus normalized wedge location 
within Yee’s cell for the finite difference algorithm without (a) and with (b) correction. 
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Abstract 

An FDTD-based code for treating 3D arbitrary conducting and material bodies is presented. The 
geometry is modeled using NURBS surfaces, one of the most powerful and flexible tools for the 
geometrical representation of objets. On the other hand, the use of the CFDTD technique avoids 
the use of a staircase to shape the body, what improves the solution. The CFDTD method has 
been slightly modified when it is applied to material bodies by considering the discontinuity of 
electric and magnetic fields crossing the medium-air interface. The method has been applied to 
obtain RCS values of both conducting and dielectric bodies modeled by NURBS surfaces. Good 
agreement with results from other methods is observed. 


Introduction 

The FDTD method has been successfully applied to a wide variety of electromagnetic 
problems. Originally, its main limitation was its application to Cartesian orthogonal grids that 
involves the use of a staircase model to represent the body. It is known that this kind of 
representation introduces spurious responses and therefore degrades the solution. There are 
several ways of avoiding the use of the staircase model. In this communication the solution 
presented in [1] has been chosen. In [1] the orthogonal grid is preserved in almost the whole 
computational domain except in a few cells, those intersecting the geometry. In such cells, the 
contours in which Ampere’s and Faraday’s laws are applied in order to update E and H are 
adjusted to the shape of the conducting body. This method is known as CFDTD (Conformal 
Finite Difference Time Domain). As is presented in reference [1], this method is limited to 
perfect electrical conductor (PEC) bodies. Next, in [2], the same authors extended the method to 
deal with dielectric media, obtaining the portion of the contour inside the dielectric body and 
defming an effective dielectric constant. 

The contribution of this communication is double. In one hand, the formulation 
developed in [1] is applied to the study of perfectly conducting bodies modeled by NURBS 
surfaces, and in the other hand an alternative approach to the method of [2] is provided in order 
to deal with material bodies. 

NURBS (Non Uniform Rational B-Splines) [3] surfaces are curved surfaces of an 
arbitrary degree. Because of their arbitrary degree, they shape complex bodies accurately with 
limited (few) information. NURBS are becoming one of the most universal tools for the 
geometrical representation of objects. The application of NURBS and other geometrical tools to 
solve electromagnetic problems has provided excellent results in many cases. In [4-6] several 
codes to analyze the RCS from complex targets using high and low frequency techniques are 
presented. 
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An FDTD-based program that analyzed arbitrary geometry modeled by NURBS surfaces has 
been developed. The scatter is embedded inside a regular computational space (orthogonal grid) 
which is only deformed for the field components located next to the geometry to accommodate 
the curvature of the surface. The computational space is ended with UPML [7] absorbing 
boundary conditions and the far-field computations are performed using NF-FF (Near Field -Far 
Field Formulation) as it is done in [8] 


Theory 

As stated before, conducting (PEC) and material bodies are considered. Depending on the 
nature of the body the equations which update the.E and H components will be whether 
those of [1] for PEC case or an improved version of [2] for material bodies. 

For the sake of completeness, the updated equations for the PEC case are presented, in 
particular, for a H field component close to the geometry. Thus, H z | f j ^ is updated from the 
following equation -for more details see reference [1], 


H z n+ { (*,M)= H z n -\(i,j,k) + 


2 ’ fj*Area(i,j,k) 

E x n (i,j + l,k)*l3 - E x n (i,j,k)*l\ + E y n (i,j,k)*l2 



Figure CFDTD on perfecly conducting bodies 


Because the geometry is defined by NURBS surfaces, geometrical rendering algorithms have 
been developed in order to obtain the intersecting points of the contour with the geometry. 

For the treatment of dielectric media, the approximation presented in [2] is improved when the 
discontinuity of electrical and magnetic fields crossing the medium-air interface is considered 
(assuming the scatterer surrounded by air). The process to update a particular component of 
magnetic field H (H z \jj^ ) following Faraday’s law is developed below. 
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Expressing Maxwell’s laws in integral form, fields E and B are related following 
Faraday’s law: 


~d j Bds 


s 

dt 


- i" 


( 2 ) 



Figure 2.- Parameters to consider when Hz\i,j,k & updated (material media) 


Let us suppose that H z \ij£ is close to the geometry (that is to say, the contour Cl that 
surrounds it, intersects the geometry). In that case, a new formulation has been developed to 
update that component in time. Otherwise, Yee's formulation [9] is used. 

In figure 2 are pi and Si are the permeability and dielectric constant of the material medium and 
S is the area bounded by contour Cl. Applying Ampere's law to the perpendicular contour Cl, 

the variation of the flux of B 2 is related to the values of E x and E y in Cj. Otherwise, the 
contour Ci is defined by segments C, = {ij, L 2l , L 22 , L 3 ,1 41 , L A2 } as is shown in figure2. 

Applying Faraday’s law and assuming thatE is constant in the segments which conform C l} the 
right term in (2) is developed as follows: 
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- d j&ds 

--- =L E x u, 

dt 111 


+J ^21 ^21 ^21 + ^22 ^22 U 22 + -^3 ^3 ^3 + ^41 ^41 ^41 + ^42 ^42 **42 


(3) 

where , Ej y Lj are respectively the unitary vector, the electrical field and the length 
of segment j -see figure 2- 

Notice that the fields E 1 , E 2l , E 3 y E 42 are defined on the middle points of the 
segments that define the contour surrounding the surface S (points represented with crosses in 
figure 2). These fields are updated at every time step using CFDTD equations and they are 

always known. In equation (3) E 22 y E A1 are still left to define (points represented by circles in 
the figure). 

In the segment I 22 , E 22 is determined from£ in 121, enforcing continuity if the 
tangential electric field across the material boundary. 

Using as a reference figure 3, the point P represents the point where segment I 2 i 
intersects the geometry. Using geometric algorithms, it is possible to obtain in P the unitary 

tangential and normal vector (called n y t ) and decompose E* is terms of those vectors: 




(4) 



From (4), E 2l is splitting into its normal and its tangential components with respect to point P. 
Enforcing the boundary conditions (continuity of tangential components and discontinuity of 
normal components) E 22 is expressed as: 
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f 



V 

-r M 

l e o ) J 


where e 0 is the permittivity of free space (because it is assumed the body surrounded by air). 
Similarly, E 4] is obtained from £ 42 . 


( 5 ) 



The above formulation has to do with the development of the right term in (2) - that is to say, 
the circulation of E along contour C r . Next, the left term will be considered - variation of flux 
of vector B through surface S (see figure 1) that we rewrite in order to clarity: 


-3 jBds 

s 


Considering that B=pH, assuming that H is constant in S and averaging the value of |i in S; the 
value of H . component, is updated as follows 


1 

n+— 


H v 2 = H 7 


where fi is defined as: 


\_ 

~2 _ 


At 


jidl 

Cl 


: = jjuds 


and At is the time step. 


( 8 ) 


(9) 


In a similar way, we can derive finite-difference expressions for the Hy and Hx field 
components and the 3 components of E. 


RESULTS 


Bistatic RCS values of a perfectly conducting sphere of radio 0.03819 with ka=2.3 is shown in 
figure 4. The result is compared with the MM formulation of [6]. Each quadrant of hemisphere 
is modeled using only one NURBS surface; therefore eight NURBS surfaces are needed to 
conform the geometry. The sphere is illuminated by a theta-polarized plane wave that is 
incident in the way indicated in the sketch of figure 4. 
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Figure 4.- RCS of a conducting sphere with ka-2.3 


The behavior of arbitrary media is tested with a dielectric sphere with e r =5. The incidence is the 
same that of the previous case. Bistatic RCS for E and H planes is presented. Once again, there 
is very good agreement between the present formulation and those of those of [8] 
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Figure 5.- Bistatic RCS of a dielectric sphere with s r ~5 


Conclusions 

A FDTD approach to the analysis of complex tree-dimensional arbitrary conducting and 
material bodies has been presented. The geometry of the body is modeled by NURBS surfaces, 
which is and advanced geometrical tool. Fcg-material bodies, the CFDTD formulation is 












improved by considering the discontinuity of field components crossing the medium-air 
interface. 
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Abstract - A hybrid, three-dimensional Finite Element/Finite Difference Time Domain (FE/FDTD) 
technique for solving complex electromagnetic problems is presented in this paper. The method 
combines the computational simplicity of the structured FDTD scheme with the versatility as well as 
flexibility of the Finite Element method, and enables us to accurately model curved geometries 
without resorting to staircasing. In the scheme presented herein, the interfacing between the FE and 
FDTD algorithm is accomplished without the use of any overlapping in the transition region between 
the corresponding meshes. Preliminary numerical results that illustrate the accuracy of the method are 
included in the paper. 


I. INTRODUCTION 

It is well known that the staircasing approach employed in the conventional Finite Difference 
Time Domain (FDTD) method can introduce significant errors in the field solution when modeling 
curved geometries. In contrast, the Finite Element method (FEM) method is well suited for handling 
complex geometries as it employs an unstructured mesh that conforms to these geometries. Moreover, 
the weak form based on the Faedo-Galerkin formulation provides a very natural way for handling the 
field and flux continuity conditions at the material interfaces, and yields accurate results that can be 
further improved by using higher-order interpolating functions [1]. However unlike the FDTD, the the 
FEM requires the solution of a matrix equation with a number of unknowns that can be fairly large, 
especially when the tetrahedral elements are used. Even when a skyline band or a non-zero value 
storage is employed in conjunction with a sparse matrix solver, the solution of the system matrix can 
become an excessive burden on the CPU time. This attribute of FEM is especially inconvenient in its 
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time domain application, where a matrix solution is needed at each iteration step [2-3] in time. In 
contrast, the FDTD requires no matrix inversion and, hence, is numerically efficient. 

In this paper, we present a hybrid numerical techniques that combines the above two methods 
in a manner that retains the advantages of both. In particular, the FE is used only in regions with 
curved surfaces and high degree of inhomogeneities; thus its computational burden is kept low. The 
balance of the computational domain is analyzed via the FDTD, fully exploiting its numerical 
efficiency. A similar technique has already been presented in recent literature for the 2-D case in [4], 
and extended to the 3-D case in [5]. However, the hybridization of the FDTD and FEM schemes in 
[4] and [5] requires the knowledge of the electric fields at two different locations in the overlapping 
region, where the two boundaries (exterior and secondary) are defined. Experience shows that the use 
of such an overlapping region can introduce some difficulties in the mesh generation procedure. A 
different approach has been followed in [6] that circumvents this problem, where the interlacing 
between the FDTD and FEM is realized by using an interpolation scheme in the interface region. 
However, the two regions still partially overlap in this case, even though they do not need to be 
interlaced as in [4] and [5]. Additionally, the analysis presented in [6] is restricted to the use of prism 
elements in the FEM procedure. In contrast to the above-mentioned works, the method presented 
herein, which is based upon the technique presented earlier in [7] and [8] for two-dimensional 
problems, does not employ an overlapping region, and enables one to fully exploit the capabilities of 
modem mesh generators utilizing general elements. Indeed, the hybridization of the FDTD and FEM 
schemes is realized by using the electric field only at a single location on the interface, thus 
circumventing the need for an overlapping region between the FDTD and FEM domains. The main 
contribution of this paper is to present the three-dimensional extension of the technique described in 
[7], and to illustrate its application via some numerical examples. 


n. FORMULATION 

We begin by dividing D, the computational domain, into two sub-domains and D/rjr, 
corresponding to the FDTD and FE regions, respectively, such that D = D fd u D^. Next, we mesh 
these two regions by using structured and tetrahedral meshes, respectively, with common nodes 
shared at the interface but with no overlapping (D/rp n T>fe ~ 0). The unstructured grid is confined 
to the vicinity of the irregular region of the domain, and this allows us to limit the number of 
unknowns in the FE region to a moderate size. 

The FDTD solution in D^d is conventional in nature and, hence, requires no further 
discussion. In D/r £ , an FE formulation with curl-conforming vector edge basis functions is employed 
in the time domain to discretize the second-order vector wave equation. 

The specific FE-TD formulation adopted herein relies on the discretization of the second-order 
vector wave equation with appropriate boundary conditions. The solution for the electric field is 
considered in a volume £2 [3]: 
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„ 1 „ - dE d 2 E 

Vx—VxE + cy— + e —— 
fi dt dt 2 


dJ_ 
dt 9 


( 1 ) 


where fi, e and a are the permeability, permittivity and conductivity of the medium, respectively. 
Next, the fields are expanded by using the edge elements and a Galerkin testing procedure is 
employed. This enables us to numerically express the above equation as: 


d 2 e de 1 
£ A—= + cr A-=+ — Be 
= dt 2 -dt \l=~ 



( 2 ) 


where Ay = -Wj dO. and By = jVxW; -VxWj dQ, mdW; are the edge vector basis functions 
n n 

used for expanding the field, though it is possible to employ other basis functions as well. Specifically, 
curl-conforming vector edge basis functions [2] have been utilized in the present work. Their 
expression in simplex coordinates reads: W% [ i - 1,2,3,4, where W t is the basis 

function pertaining to the z-th edge delimited by nodes j and k. Assuming j < k , the above functions 
constitute six vector basis functions that have zero divergence and constant curl. 

By resorting to a mixed central and backward difference scheme for the derivatives with 
respect to time, (2) can be transformed into a marching-on-in-time algorithm as follows: 


e n+l = M(n e” +P e" -1 +/*), 

where 


( 3 ) 


M = 

N as 


Lat 2 

~ 2e 
AT 2 ' 


P 



ie£l A+ l®iz>B, 

AT J- fl = 

(e,-l/2)ff ] A (20 2 -0,) d 

AT J- ix 


(4a) 

(4b) 

(4c) 


e denotes the unknowns vector at time step n 9 and /” is the vector expansion coefficients for the 
current density j . With a proper choice of the two stability parameters 0j and 0 2 , this scheme is 

unconditionally stable; in particular, for 0j >0 and 0 2 >1/8 the results are stable [3] regardless of 
the choice of the time step. It is especially crucial to employ an unconditionally-stable time domain 
formulation for the FEM algorithm in problems with very small features that lead to large variations 
in the element size across the problem domain. This enables us to employ a time step which is 
identical to that dictated by the Courant condition in the FDTD domain, and which is typically much 
larger than the corresponding one in the FE domain. An added advantage of this hybrid method is 
that, unlike the classical subgridding techniques in FDTD, the present approach requires no 
interpolation in the time domain. 


970 





m. TIME-MARCHING SCHEME 

Next, we turn to the time marching scheme that has been implemented in our algorithm. Let 
He'd 2 be a generic component of the magnetic field close to the interface between the FDTD and 
FE regions in D fd, at a generic time step t n , while Hbe 12 is the adjacent magnetic field located in 
the T)fe domain (see Fig. la). The electric field component E$ can be evaluated in accordance with 
the FDTD algorithm, provided that the location of Hse 12 coincides with that of the magnetic field 
in a fully-structured grid. We address this problem by first identifying the tetrahedral element in the 
FE region which contains the point location of the desired Hse 12 > and then evaluating it exactly 
from the knowledge of the electric fields in the element. It is important to point out that, in the most 
general case, the tetrahedral element is different for each component due to the staggered location of 
the magnetic field components. Indeed, this is crucial for the implementing the hybrid procedure in a 
correct manner. 

In the following, we illustrate the computation of the magnetic field from the knowledge of 
the corresponding electric field values at the edges. In particular, if we expand the electric field over 
the 4 edges of a tetrahedral element in the FEM domain, we can write 

( 5 ) 

where is a generic position located within the element h, the corresponding magnetic field is 
obtained by discretizing the relationship: 

H(r,t) = --jVxE(?,t)dt, (6) 

P t 

which leads to the recursive formula: 

i w ' 4 - ij, _ 

H % +1/2 V y W j AT = VxW’yAT +tf£- )/2 . (7) 

^B=I/=l ( I J 

In (7), the index n refers to the temporal discretization. We note that the edge elements have 
the property that V x Wj is constant, implying that the magnetic field is constant within a generic 
tetrahedron. 

Let us now go on to describe the time-marching scheme. We begin by updating all of the 
electric fields E” FD in the region Dfd in accordance with the FDTD algorithm, using the values of 
the magnetic fields H n '^ n in the region D fd together with the magnetic fields tf^ 1/2 at the 
interface. All of the magnetic fields H^ 11 in the region can next be evaluated from E n FD . The 
electric field E n g along the interface between E>fd and D fe is used as a boundary condition (initial 
value in time, boundary value in space) for the FE procedure. We note that, at this stage, we also 
need the electric field at the edge not coincident with the FDTD grid (see Fig. lb); this can be 




computed by using a simple linear interpolation between the values at the sides. Once these 
boundary conditions are fully specified, the electric field E n FE at the time step t n can be evaluated 
throughout the region D^. The procedure can now be repeated to continue the time iteration. 



FDTD < - 1 


It 

;; 

fi 

t 


H-field 


E-field 



a) 


b) 


Fig. 1 - Interfacing the FEM and the FDTD regions. 


IV. NUMERICAL RESULTS 

We will now present some numerical results to validate the hybrid technique. Specifically, we 
examine the problem of scattering from a PEC cube 60 cm on a side, illuminated by a dipole source 
located at a distance of 80 cm from one of its faces and oriented along the z-direction of a cartesian 
coordinate system (Fig.2a). The results obtained from an ‘TDTD-only” scheme are compared with 
those derived for the same problem by employing an FEM region within the computational domain. 
The FDTD computational domain comprises 50x50x50 cubic cells, 10 cm on a side, and uses second 
order Absorbing Boundary Conditions (ABC). For testing the hybrid technique, an FEM region 
\mx\mx\m containing the PEC cube has been embedded in the FDTD computational domain. The 
region between the cube and the FDTD domain has been meshed by using 3141 tetrahedral elements 
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with 5829 edges, which include 1800 boundary edges and 600 PEC edges. In both of these cases, the 
time step was AT = 1.8 - 1(H° sec, and the FE stability parameters were chosen to be ©! = 0.2 and 
© 2 == 0.3. The dipole source radiates a gaussian pulse with a -3dB frequency equal to 150 MHz. In 
Fig. 2b, we plot the y-component of the electric field observed at a distance of 80 cm below the cube, 
computed with the two methods alluded to above. Excellent agreement between the two results serves 
to adequately validate the accuracy of the hybrid scheme presented in this paper. Before closing we 
mention that satisfactory results also have been obtained for different observation points and for all of 
the components of the electric and magnetic fields. Additional tests performed on problems with 
different geometrical and electrical parameters have further confirmed the accuracy of the procedure. 


V. CONCLUSIONS 

A three-dimensional hybrid technique for combining the Finite Element and Finite Difference 
methods in time domain has been presented. The hybridization between the two algorithms has been 
achieved without the use of any overlapping or transition region. This not only enables us to take full 
advantage of modem mesh generators, but also renders the technique flexible as well as accurate for 
solving complex electromagnetic problems. 
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Fig. 2 - a) Geometry of the problem; b) Ey component at the observation point computed by using 
a fully FDTD scheme (continuous line) and the present hybrid technique (dashed line). 
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Abstract - Evaluation of the electromagnetic field radiated by base station antennas in complex 
surroundings requires the development of hybrid methods joining local and asymptotic techniques. In 
the proposed method, geometrical optics rays are traced from a suitably defined quasi-constant-phase 
surface located in the near field region of a vertical antenna where the usual far-field approximation 
fails. Such a near-field wave front is determined by processing near field data, which are obtained by 
a conventional analysis of the field radiated by a stand-alone antenna. Results of a preliminary 
application of the method to a planar array of elementary dipoles are reported showing the 
practicability of the method. 


L Introduction 

Base station antennas for mobile communications are normally installed in complex 
environments including walls, buildings, pylons and humans. Environmental effects on 
electromagnetic propagation must be taken into account to comply with coverage requirements and 
exposition restrictions of people and things. To this purpose an accurate and efficient numerical 
prediction of electromagnetic near field is desirable. Ordinary local algorithms, based on Method 
of Moments, Finite-Difference Time-Domain or Finite Elements, although very accurate in the 
detailed modelling of stand-alone antennas, fail in modelling electrically large environments. On 
the contrary, high frequency methods, based on ray tracing including edge diffraction, are fast and 
robust in modelling electromagnetic propagation and interactions with scatterers, but they are 
accurate only if the scatterer is large in terms of wavelength and does not exhibit small features. 
Recently hybrid procedures were presented which employ local methods for the antenna and 
asymptotic methods for the environment. In [1] the FDTD method was applied to model a 
bounded region including dielectric bodies while the field interaction in the external region with 
large conducting bodies was modelled by a generalised ray expansion. The interface between the 
ray region and the FDTD region was defined by means of equivalent surface sources. In [2] a 
moving frame procedure was proposed, where an asymptotic technique was used for wave packet 
propagation while interactions of each wave packet with environmental scatterers were FDTD 
modelled. In [3] a hybrid procedure takes advantage from ray tracing to build a modified Green 
function accounting for the antenna complex environment. The antenna radiation pattern is then 
obtained solving a Boundary Element or Finite Element problem. In the last method, changing the 
antenna position with respect to the environment requires a complete new simulation to be 
restarted. 

In this paper we present a preliminary investigation on a hybrid procedure for direct 
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application of ray tracing methods to the antenna near field. The method works as follows. First a 
“near field wave front” is obtained close to the antenna by retrieving a quasi-constant-phase (QCP) 
surface. Then the wave front is partitioned according to a polyhedral mesh and astigmatic ray 
beams are launched and traced in the environment [4]. The retrieval of a QCP surface for the 
antenna near field is a non-linear problem, which can be computationally expensive for real 
antennas. Therefore the antenna stand-alone near-field, that is obtained once and for all by means 
of a numerical simulation with a local method (or even by measurements), is sampled and 
efficiently interpolated on a surface enclosing the antenna. Since the antennas of radio-base 
stations have generally vertical structure, a vertical circular cylinder is adopted as the enclosing 
surface, from which the field is propagated as a superposition of cylindrical waves for the 
computation of QCP surfaces. (Fig-1). Under the hypothesis that the antenna field is not modified 
by the environment, which is acceptable for the computation of exposition volumes, ray-tracing 
only has to be repeated for any antenna location with respect to the environment. The procedure 
to obtain the near-field wave front will be presented in the next sections, while the integration with 
an efficient ray-tracer will be the subject of future work. 



Fig. 1: Schematic view of the hybrid procedure: A wave front Sqcp is retrieved from near field 
data close to the antenna. Ray tracing is performed outside Sqcp- 


n. Method 

A vertical antenna, with the main size in the z-direction, is placed at the origin of a 
cylindrical co-ordinate system (p,<j>j). Consider the electromagnetic field radiated by the stand¬ 
alone antenna and an unbounded cylinder So having radius Ro> axis z and enclosing the antenna. 

The field can be obtained at a discrete set of points belonging to So by means of a 

numerical procedure (or by measurements). 

The field outside So, can be expressed as superposition of elementary inhomogeneous 
cylindrical waves [5]: 


Vm = * Jmi> H m ] (K-P)e >P 


( 1 ) 
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with the Hankel function of second kind and order m, K = -fi$ -P 2 , Im[K]<0, k§ -2nlX 

(X is wavelength), p is longitudinal wave number. The resulting representation splits into TM and 
TE waves. TM equations are: 


= ~/'P 2 a, 


TM &Vm 

dp 


H = Y 0 ZxE 


m=-oo 
TrZ _ r 2 


e 2 =(*o -p 2 ) 


( 2 ) 


where T 0 * coe/p . Dual equations can be derived for TE waves with E~ZqH_x2 , Zq = cop/p. 
For a given radiation field, the coefficients a™ and a™ are functions of p. Coefficients a™ and 
a™ can be retrieved from the field evaluations at )}m • fact we have for 

E Zi i =E 2 (RoAi^i)' 

+00 _ 

E ZJ -2 (T J a ™ (A 2 - p 2 )H™ (*o #o ~ P 2 (3) 

I tt —00 

while a similar equation holds for H 2i = H x {R^ 9 ^ 9 z t ) from which coefficients a™ are 
determined. 

A wave front is normally defined as a constant-phase surface in the antenna far field, where 
all the field components have the same phase function, moreover £, H. and the normal, n , to the 
wave front are mutually orthogonal. Nevertheless it is possible to define a quasi-constant-phase 
surface in the antenna near field if the above conditions are not strictly enforced. This surface can 
be computed according to the following iterative procedure: 

1) Starting from a radial distance i?j > Rq , a constant-phase surface Sq^ p for the dominant field 
component ( H# for vertical, z-directed antennas) is carried out via a search procedure on z- 
slices. Hf is computed by equations (2). S^p is obtained as stacked constant-phase lines. 

2) The Poynting vector & R + jSi is computed at points of S^p . Then two tests are performed: 

(i) |£/|/|S*| <£i; (ii) cos _1 (^-5^/|5/?|)<£2, where and s 2 are positive small numbers. 
These tests are repeated for a sufficiently dense grid of points on Sqq ? . 

3) If the tests are passed then Sqq, is considered as a quasi wave front surface. Otherwise, the 

initial distance is increased: -» R\ + Ap and the procedure restarts. 

A particular field interpolation on the cylinder S 0 proved to be useful to solve for a™ by 
equation (3). Though the electromagnetic field exists all over S 0 , nevertheless it is expected to be 
substantially localised within a cylindrical waist of S 0 centred on z=0. Therefore each 
electromagnetic-field component, e.g. E z , is expanded on a Hermite polynomial basis [6,7,8] 
accounting for z- dependence as follows: 
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( 4 ) 


£,(* 0 ,*z)= £ £A mn h„(z)e■-** 
m=-oo n~ 0 


with: 


Ki z ~)= e 2 


H n {cz) 
(vrc2 n n\) m 


(5) 


H n (cz )is the scaled Hermite polynomial of order n, and c is a scaling factor. The set \h n } 
constitutes a complete orthonormal basis, defined on the whole real axis z ; which provides a 
compact support, around z-0, controlled by the scaling factor. A further property of functions h n 
(also called associate Hermite functions) is used for solving equation (3), i.e., the Fourier 
transform of h n also is an associate Hermite function. The set {A^} constitutes the spectrum of 
H+ with respect to the chosen basis. The summations in equation (4) are truncated to M and N, 
respectively, for numerical computation. 


m. Numerical Example 

To test the method we considered a curtain array of 8 x 2 elements, with XIA horizontal 
spacing and XI2 vertical spacing. Infinitesimal electrical dipoles of unitary moment 71 oriented 
along the z-axis (Fig. 2a) were used. 



Fig a) curtain array of infinitesimal electrical dipoles; 


b) \E Z \ on cylinder p — Rq ~ A,, for It = 1 . 


This geometry is quite similar to that of standard antennas for mobile-communication base- 
stations, with the electrical dipoles replaced by half-wavelength wire radiators. Since the overall 
length of the reference antenna is D- 3.5A, the conventional far-field transition distance from the 

antenna is 27) 2 /X = 24.5 X. This antenna radiates a TM field since H z is zero everywhere. The 
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spectrum A mn was obtained from closed-form computations oiE z at points of a cylinder S 0 with 
radius Rq = X (Fig. 2b). Only a few spectral terms were necessary to obtain an accurate field 
representation for p>R$. As an example, the reconstruction of by M=6 and N=9 terms is 
shown in Fig. 3 for comparison with the exact results. 



Fig. 3: Real and imaginary parts of vs. zJX on the line p=3k, <fr=0 for the array in Fig. 2: 
Exact solution (dashed line) and numerical solution (continuos line). 


An example of QCP surface, computed by the iterative procedure with initial distance =3A,, is 
shown in Fig. 4a. This surface exhibits a nearly cylindrical cross-section in the central region, 
where the Poynting vector takes larger values. Surface normal and Poynting vector real-part 
direction are superimposed at points of the QCP surface in Fig. 4b. A substantial parallelism can 
be appreciated from the diagram, except for larger values of z, which, however, provide less 
contribution to the overall radiation. 



Fig. 4: a) Quasi-constant-phase surface for the array in Fig. 2. Poynting vector amplitude also 
shown, b) Cut 4>=0 showing surface normal (continuous line) and Poynting vector real part 
direction (dashed) 
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IV. Conclusion 

The radiation-field computation problem for a vertical antenna in the presence of scatterers 
in the antenna near field has been considered. The results of a preliminary investigation on a 
hybrid procedure have been presented, that is based on a ray-tracing interface with field data for 
stand-alone antennas. Ray tracing is started from a near-field wave front close to the antenna. 
Therefore the problem addressed by this paper was that of finding a quasi-constant-phase surface 
in a region close to the antenna, where the usual conditions for the validity of ray optics are only 
partially met. Initially the field is computed on a vertical cylinder enclosing the antenna by a 
suitable method (e.g., method-of-moments) to account for antenna geometrical and physical 
characteristics. The propagation outside the cylinder is modelled as the sum of cylindrical waves, 
whose coefficients are retrieved by enforcing the continuity to the stand-alone antenna field at a 
set of points of the cylinder. The formulation takes advantage from field data interpolation on a 
basis of Hermite polynomials. A preliminary application to a simple curtain array has shown that 
ray tracing can be applied from a 3 A. distance onwards. 
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Abstract —A hybrid approach that relies on the decomposition of the simulation geom¬ 
etry into a number of uniformly meshed subregions is proposed to model complex three- 
dimensional structures. The 3-D boundary integral technique is used for homogeneous 
bounded and unbounded regions. Each inhomogeneous subregion is either described at one 
frequency by a finite eliment method combined with a frontal elemination algorithm, or an 
interaction matrix is calculated at multiple frequencies using FDTD. The interaction ma¬ 
trices and the excitation vectors resulting from the three different techniques are assembled 
into one global matrix equation by expressing continuity of the tangential fields at all the 
interfaces between the subregions. 


L Introduction 

With the drastic increase of the computer power, conventional numerical techniques such 
as the finite element (FE) method, the finite-difference time-domain technique (FDTD) and 
the boundary integral equation (BIE) approach have proven their use in versatile modeling 
of 3-D EM configurations. However, the applicability of the FE and FDTD method mostly 
lies in describing problems with an inhomogenous and bounded character, whereas the BIE 
technique is very efficient in handling unbounded structures composed of a limited number of 
homogeneous subdomains. This dearly suggests that new, more powerful formalisms can be 
found by decomposing the simulation geometry into different subregions and by describing 
each of them with the most efficient numerical approach for that domain. Up to now, the 
most popular of these hybrid methods has been the combination of finite elements with a 
boundary integral equation in the frequency domain. 

Quite recently, we have developed a new formalism in which both the FE method and 
FDTD algorithm are combined with a BIE approach [1,2]. The FE method or the FDTD 
technique then takes into account the inhomogeneous parts of the configuration whereas 
the BIE approach is used to rigorously model the open character of the structure and the 
interactions between the different subregions. In previous publications, much attention was 
devoted to the coupling of the FDTD formalism with a BIE formulation in the frequency- 
domain [3-5]. The technique allows to obtain a system matrix for a given inhomogeneous 
subregion within a certain frequency band, by performing a number of FDTD simulations. 
In this contribution, it will be shown how the FE formulation can also be efficiently incor¬ 
porated into the hybrid formalism, by means of a frontal elimination technique. In [1,2], 
this technique was applied to study the radiation of lossless structures. In this paper, lossy 
configurations and plane wave incidence are also considered. 
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II. Geometry decomposition 



Fig. 1. Sample 3-D geometry with partitioning into subregions. 


Consider a 3-D configuration consisting of scatterers and sources, schematically repre¬ 
sented in Figure 1. Let us now assume that the simulation geometry can be decomposed 
into a number of subregions. The field equivalence theorem then allows a separate de¬ 
scription of each subregion by introducing additional virtual sources J virt = n x H and 
M V i r t = E x n on the interfaces between the different subregions. These latter sources act 
as the problem’s unknowns. 

Such a partitioning scheme can considerably enhance the efficiency of the EM problem 
description. First of all, each subregion can be described with a different modelling strategy, 
specifically adapted to the particular EM features of that subdomain in the following way. 
For inhomogeneous and bounded subregions, one can chose between two numerical tech¬ 
niques: either one uses the FE method for the type la subregions or the FDTD algorithm 
for the type lb subregions. For homogeneous bounded type II and unbounded type III sub- 
regions, a BIE formulation is applied. Current sources (schematically represented by the 
source regions J/, J// and J/// in Figure 1) can be present in all the bounded subregions, 
and a plane wave can be incident from within the type III subregion. Second, the description 
of a particular subregion can be reused in the presence of several identical subregions within 
the configuration. This is typically the case when modelling frequency selective surfaces [3j. 


III. Hybrid FE-FDTD-BIE formulation 

A. BIE formulation 

Homogeneous bounded or unbounded regions are described by means of a BIE technique. 
Consider a type II subregion V m , with virtual interface S m between this subregion and the 
rest of the simulation geometry. We use the 3-D Poggio and Miller [6] integral expressions 
to find the electric and magnetic fields at the interface S m due to the virtual sources on S m : 

E = -ifirj fJ G3dV ~JJ [VG x (Exn) + V-(Hxi)^ + j> r G(H x a)}dS (1) 

H = —J f J^VG x J dV - J [VG x (H x n) - V • (E x n)^ - je r G(E x fi)] dS (2) 
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with G (r | r') = e * . Similar expressions hold for the type III subregion in Figure 

1, involving virtual sources on interfaces Si, Sj and S m and with addition of an incident 
plane wave source contribution E ln and H in . Note that by using both tangential electric and 
magnetic field components in the formulation, the problem of interior resonances is avoided. 

Since we will solve the global problem using a method of moments technique, it will 
be necessary to discretize the virtual boundaries 5», Sj and S m into a uniform mesh of 
rectangular cells and to represent the unknown virtual sources by rooftop basis functions 
w r : 

N N 

E(r) = ££ r w r (r), H(r) = £.ff r w r (r), r£SiUS,-US m (3) 

r=l r^l 

To determine the set of unknown expansion coefficients (£ r ,i7 r ), these same rooftop func¬ 
tions wall be used to apply Galerkin weighting to the boundary integral expressions. For 
each type II and III subregion, the discretization process results in a matrix expression of 
the following form: 



It will now be shown how similar matrix expressions can be found for the inhomogenous 
subregions, either with an FE method or an FDTD formalism. 


B. FE technique 

Let us now focus on the inhomogeneous Type la subregion Vj with boundary Si. The 
following Galerkin FE formulation is used to describe subregion Vf: 


/ Hv V * w ”)'^ l '( VxE ) rfV - fc o/ + » JJJ™ 


■ J dV 


= +JW//0 


It,; 


• (n x H) dS 


(5) 


Ij yV x w r ) -t; 1 ■ (V x H )dV-*%fjf™r ■ P r ■ H dV- jJJ{ 


= - jwe o 


ft 


w H ■ (n x E) dS 


(Vxw r )-!7‘.J dV 


( 6 ) 


The formulation is discretized by covering the volume of a type la subregion with a uniform 
mesh of bricks and chosing expansion and weighting functions w r to be the 3-D versions 
of the rooftop basis functions applied in (3). Discretisations of the FE subregion and the 
adjacent BIE subregions are chosen conform, meaning that the boundary discretisation and 
the field expansion for the BIE formulation is found as the restriction of the FE mesh to 
the boundary of the subregion. 

During the construction of the FE system matrix, a frontal technique is used to condense 
the sparse matrix to a dense matrix containing only the coefficients within the expansions 
(3) of the fields on the boundary 5* of the Type la subregion as unknowns. By chosing 
the grids uniform and conform to each other, the frontal elimination can be done in a very 
efficient and robust way. For each type la subregion, the process leads to following matrix 
expression, similar to (4): 

©-(£)♦*=••© 


C. Constructing a subregion’s interaction matrix using FDTD 


In [4], it is shown how a matrix expression, similar to (4) and (7), can be constructed for 
each type lb inhomogeneous subregion Vi with FDTD. 

■ <jf 


©- 




+ Zp: 


© 


( 8 ) 


To find an EM description of a type I subregion V), the interaction matrix Z FDT D must be 
constructed , which relates the electric and magnetic fields on the boundary Sj to the virtual 
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currents J v i r t and on 5*. We therefore define a mesh of rectangular BIE cells on its 

boundary Si and expand the unknown fields on Si into rooftop basis functions following 
(3). Since the subregion is described by performing FDTD simulations, its volume Vi must 
also be subdivided into a mesh of Yee cells. We choose the FDTD and BIE discretizations 
in such a way that one BIE cell comprises an integer number of Yee cell facets, as shown in 
Fig. 1. 

The subregion Vi is then excited by each basis function, implemented as an electric or 
magnetic current source within a separate FDTD simulation. The resulting electric and 
magnetic fields on 5* yield one column of the FDTD interaction matrix. By performing a 
number of FDTD simulations equal to the number of basis functions, the complete inter¬ 
action matrix is formed. To take into account the excitations J/ within a type la domain, 
an additional FDTD simulation is required [5]. The geometry of Vi remains the same as for 
constructing Z FDTD , but now J/ acts as source and no virtual currents are present. 

Because of the large number of simulations to obtain a system matrix, special measures 
are taken to perform fast FDTD calculations by reducing transients within the FDTD 
simulation domain. This is done by invoking field equivalence, since fields are zero outside 
the FDTD subdomain, because of the presence of the virtual currents J V i rt and on 

S{. To avoid inefficient FDTD simulations due to long lasting transients, the FDTD grid 
is terminated in the region outside Vi by an absorbing boundary condition or by absorbing 
Berenger material. Obviously, the introduction of absorption does not affect the accuracy 
of the final results, because fields are zero outside V{. 

As demonstrated in [4], the fields at the boundary due to the excitation of one basis 
function can be derived at multiple frequencies in one FDTD run, by chosing a suitable time 
dependence for the current sources (e.g. sum of sines or Gaussian pulse). After exciting all 
the rooftop basis functions, an FDTD interaction matrix is stored on disk for the different 
frequencies under consideration. Since geometrically identical subregions have identical 
interaction matrices, such a matrix can be reused each time a subregion with that specific 
geometry occurs. 

D. Assembling the global geometry’s interaction matrix 

A global description of the geometry is found by imposing the continuity of the tangential 
fields at Si/j in a weak sense through Galerkin weighting: 

lim f f w r -(nxE BIE (r))dS = lim f [ w r ■ (n x E FE / FDTD (r ))dS (9) 

J 1j j J JSi/j 

r Urn JJ w r • (n X H BIE (r)) dS = f Km JJ w r • (ft x h fe / fdtd (r)) dS (10) 

for all basis functions w r and at each boundary S^j. 

This yields a matrix equation of the form (^fdtD **" ^FE “ ^BIE) ■ X — B with X 
containing the unknowns E r and H r and B a source vector due to the current sources and/or 
incident fields. In Fig. 2, a schematic overview is presented that summarizes the proposed 
hybrid scheme. 


IV. Numerical example 

Let us consider the configuration drawn in Fig. 3. The structure consists of a microstrip 
substrate, consisting of a dielectric with e r = 12 and dimensions 9mm x 9mm x 550/im, and 
a cupper ground plane with a = 5.8 10 7 and thickness 50/im. A y-oriented Hertzian dipole, 
radiating at 10GHz, is placed at the air-substrate interface, in the middel of the substrate. 
Above this configuration, a frequency selective surface (FSS) is defined, consisting of a 4 by 
4 array of dielectric patches with permittivity € r = 12. Each of the patches is tilted over 
45°. 

For modelling purposes, following decomposition of the configuration is proposed: The 
substrate together with the dipole are placed within a FE subregion (resolution 11 x 11 x 13 
and cell dimensions 1mm x 1mm x 50//m), whereas each plate of the FSS is embedded 
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Fig. 2. Overview of the FE-FDTD-BIE Technique. 


within a separate FDTD subregion. For one such region an FDTD mesh is defined with cell 
dimensions 0.5mm x 0.5mm x 0.5mm together with a surface mesh on the virtual boundary 
(resolution 6x6x2 and cell dimensions 2.5mm x 2.5mm). 

Using the technique described in section III-C, an FDTD interaction matrix is generated 
within a frequency range from 6 GHz to 15 GHz. The resulting interaction matrix can be 
reused for all cells of the FSS. The interaction matrix is then retrieved from disk for / = 8 
GHz and / = 10 GHz and coupled with the FE and BIE descriptions. In Fig. 4 and Fig. 5. 
the radiation patterns in the xz~ and yz -plane are shown at resp. 8 GHz and 10 GHz, and 
in Fig. 6 and Fig. 7, the bistatic RCS is presented for a incident plane wave E m = e*i kz \i y 
at resp. 8 GHz and 10GHz. A good agreement is seen between the hybrid FE-FDTD-BIE 
technique and FE-BIE approach, with the patches placed in FE subregions. 
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Abstract 

Microwave and high-speed circuits are reaching ever-increasing speeds and physical complexity. 
As a result the need for accurate simulation of both lumped circuit elements and complex geometries has 
also increased. The finite-difference time-domain (FDTD) method is capable of modeling arbitraiy 
materials and physical geometries, but the implementation of frequency-dependent lumped circuit loads is 
not trivial. This paper presents a new method for representing multiport devices with arbitraiy frequency 
responses in FDTD simulations using approximate recursive digital filters. 

Introduction 

The FDTD method has been used in the past to model lumped circuit elements using the extended 
FDTD method [1]. This method utilizes an additional lumped circuit current density in Ampere’s Law to 
include the device behavior into the FDTD update equations. This method has recently been extended [2] 
to model arbitrary two-port linear networks by applying digital filtering techniques to this lumped circuit 
current density. This paper presents an approach based on the equivalent source method [3] for including 
lumped devices which can model networks with an arbitrary number of ports [4]. The equivalent source 
method relies on derived voltages and currents in the FDTD grid rather than directly modifying the Yee 
update equations. Through the use of approximate digital filters, the convolution usually required to 
include dispersive materials and devices in FDTD is implemented in a straight-forward manner. 

The equivalent source method allows for transmission line voltages and currents to be calculated at 
all device ports at every time step. Manipulation of these total voltages and currents provides the incident 
voltage wave at each port, and a recursive digital filter is used to calculate reflection and transmission from 
each port to all other ports. 

The time-stepping scheme used in the FDTD method is by nature a discrete-time system, so the 
techniques used in digital signal processing can be applied directly without modification. The sampling rate 
in FDTD is much higher than that used in normal DSP applications due to numerical stability constraints. 
At the highest resolved FDTD frequency such that the grid spacing Ax is equal to X/20, the FDTD sampling 
rate is at least 20 samples per cycle, well above the Nyquist limit of 2 samples per cycle. This leads to a 
very low ratio of the digital filter pole and zero frequencies to the sampling frequency, which can lead to 
inaccuracy and instability in digital filters [5]. 

Potential applications of this method include system-level electromagnetic simulations of large 
systems with many subsystems. These subsystems could be simulated individually, and then combined to 
determine overall performance, particularly in systems with transmission line interconnects between lumped 
circuit devices. 


Formulation 

The total voltage and current on a transmission line is given by the combination of forward 
(towards a port) and reverse (away from a port) travelling waves. The convention for defining these total 
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parameters is that the forward (V a ) and reverse {V b ) voltages add, and reverse current (4) subtracts from the 
forward current (4). The forward and reverse voltages are related to the forward and reverse currents by the 
characteristic impedance of the line, Z D . 

V = V a +V b l = l.-l b 


Combining the total voltage and current with knowledge of the line impedance, the forward travelling 
voltage can be extracted from the total instantaneous voltage. 

( 2 ) 

Using the reflection and transmission coefficients at a port, the reverse voltage that would be observed due 
to the attached load can be calculated using the extracted forward voltage. The total voltage at any port is 
thus given by the superposition of the forward voltage at that port and the reverse voltages due to all other 
ports. Using the notation of S-parameters, which are generalized reflection transmission coefficients, the 
reverse voltage V% at the/th port due to the forward voltage at port / is given by: 

y 

S =— (3) 

iJ V, V a =0 w 

at a k*i 

where Sy is the reflection/transmission coefficient relating to V bj . These reflection and transmission 
coefficients have been used in the FDTD algorithm [6] to implement multiport devices with constant 
coefficients. Examples of such devices are idealized gain stages and isolators. FDTD implementation in 
this work uses a microstrip line of known impedance with a gap where the device is located. Total voltage 
and current at the open microstrip ends are calculated from the magnetic and electric fields in the FDTD 
grid. Equation (2) is applied to get the incident voltage at each port, and then equation (3) is used to derive 
the reverse travelling voltages. The new total voltage ( V a +V b ) is then enforced in the FDTD grid using 
equivalent voltage sources at the microstrip ends, and then the simulation is advanced one more time step 
before repeating the process. 

This method can be extended to frequency-dependent loads by considering discrete convolution at 
each port In this case the discretized reverse voltage V b [k] at a port is given by the convolution of V a [k] 
with the reflection impulse response h[k] at the port. 

V b [k]=j^V c [n\{k-n] (4) 

where k and n are indices into the discrete variables V b , V a , and h. While this is an adequate method for 
calculating the reverse voltage, a recursive formulation is desirable. Fortunately, a recursive form of the 
discrete convolution is readily available in the form of and infinite impulse response (HR), or recursive 
digital filter. This type of filter is described in the Z-domain by equation (5), 

v b (z) = ^\ v Jz) (5) 

B(z) 


where 

A(z) = £a k z- k B{z) = \-f^b t z~ k ( 6 ) 

k =0 A=1 

L is the number of filter poles, and M is the number of filter zeroes. The operator z' 1 is a delay of one 
sample in the time domain, so converting equation (5) to the discrete time domain and cross-multiplying 
yields: 

1 ]- b M V b [k-M]= a«V a [k] + a x V a [k-l] + - + a L V Q [k-M] (7) 

which can be rearranged to solve for V b [k] by moving all other terms to the right side of equation (7). This 
produces a formulation for the most current value of V b in terms of the previous values of V b as well as the 
current and previous values of V a . 
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( 8 ) 


M L 

m=l /=0 

The generalized reflection and transmission coefficients in equation (3) can be put in the form of equation 
(4) to provide recursive discrete convolution at all ports. This involves approximating the Z-domain 
response at each of the ports as the ratio of two polynomial known as the Pade approximation [7]. This 
approximation is accomplished using the discrete impulse response h[k] at each port, and transforming it 
into the Z-domain. The port discrete impulse response can be thought of as a summation of time-shifted 
impulse functions, so the Z-transform can be found by inspection. 

H{z)^c k z- k (9) 

A=0 k =0 

H(z) is the Z-transform of h[k /, T is the time between samples, and c k is the amplitude of the Ath sample. 
Using the Pade approximation, H(z) can be written as [7,8]: 

-+o(z- (i+Mtl> ) (io) 

l + 2>-' 

/=1 

where 0(z' {l ' +M+1) ) is an error term in the L+M+l sample due to the approximation. The filter coefficients a t 
and b, can be found by multiplying out the denominator in equation (10) and matching coefficients of like 
powers of z 1 [7], If the negative of the b, coefficients in equation (10) are used, this form is identical to 
that in equation (5) and thus an approximate digital filter for the port impulse response has been found. 
Equation (7) can be implemented using the a t and b t coefficients found from the Pade approximation, 
provided that the port impulse response is sampled at the same rate as the FDTD time step. 

Multiplying the denominator of equation (10) over to the left side of the equation and equating 
coefficients of equal powers of z x yields a system of equations which can be stated as a matrix equation [7]: 



where the Ck’s are coefficients of the system impulse response, and the bfs are the coefficients of the 
polynomial in the denominator of equation (10). This can be solved by a number of matrix solution 
methods. The numerator coefficients, a ; in equation (10), can then be found by: 

a 0 a \ — **• — c L + ( 12 ) 

/=i 

where L is the order of the numerator polynomial in equation (10), and M is the order of the denominator 
polynomial. As can be seen from these equations, only the first (L+M) terms of the discrete impulse 
response c[k] are used, which can lead to very large numbers of poles and zeros in the approximate filter for 
long impulse responses. 

An alternative method for determining the approximate filter coefficients is a least-squares fit using 
the time-domain input u[k] and output y[k] of the system [9]. The input-output relationship is fit to a 
recursive model in the form of equation (8). 

,y[£] = b l y[k-\] + ''' + b M y[k-M] + aQu[k'\ + ''- + a [ u[k-L\ (13) 

A vector 0 of filter coefficients and another <|> of input and output signal parameters can be defined: 

0 = <P = \y{k-\}-y{k-M] u[k]-u[k-L]J (14) 
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where the approximate reconstructed output signal y[k|G]=4) T 0. Taking the sum-squared error of (y[k]- 
y[k|0]) for a sequence of length N and then differentiating with respect to 0 to find a minimum of the error 
yields a matrix formulation to solve for 0. 


*=i 

i>[*-iM*-2] • 
*=1 

*=1 

V 


|>[*-1M*] 

k=l 

f>[*-2M*-l] 

*=1 

i>[*-2] 2 . 

**i 

• ^>[*-2M k-L] 

*=1 

K 

a o 

= 

f>[*-2M*] 

f j u[k-L}y[k-l] 
_*=1 

i>[*-i] 2 •• 

*=i 

t,u[k-L] 2 

k~ I 

, a l. 

t 

f>[*-ZM*] 


This method has the feature of using all the time samples of u[k] and y[k], and ‘degrades gracefully’ when a 
suboptimal number of poles and zeros are used in the approximate filter. Both of these features are 
advantageous over the Pade approach presented in equations (10-11). Figure 1 illustrates the effect of using 
the Pade approximation and a least-squares solution when attempting to reproduce the pulse response of a 
simple HR filter, for which there is an exact solution in the form of equation (8). The filter being 
approximated has: 

0.5 0.1 0.1] =[0.1 0.1 0.1 0.1] (16) 

for the filter coefficients. When the number of poles (M) and zeros (L) used in the Pade and least-squares 
solutions are equal to the number of poles and zeros in the original HR filter, both techniques arrive at the 
exact solution, shown in figure la. When the solution space for these two methods is restricted to include 
only three poles, the exact filter response lies outside the solution space. The Pade approximation shows 
marked degradation in performance, while the least-squares solution still closely follows the original pulse 
response, shown in figure lb. For the restricted solution space the two methods yield: 

0.5007 0.0996 -0.1172] a Pade =[0.5 0.0783 -0.3174] 

^lsq =[0.1009 0.5364 0.0188] b Pad! =[0.1435 0.9217 -0.1609] > ' 1 ’’ 

The results presented in this paper use the Pade approximation to arrive at the filter recursion coefficients. 
Future work will utilize least-squares approximations. 



Figure 1— Pade (circles) and Least-Squares (stars) approximations to an HR filter pulse response 
when (a) the exact solution is included in the solution space and (b) when the solution space is 
restricted to not include the exact filter response 
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Simple Digital Filters 

In order to validate this technique, several canonical digital filters were implemented in FDTD. The 
a and b coefficients for these filters can be found from analytic expressions, which eliminated the Pade 
approximation as a source of error. Section IV presents a digital filter approximation for a three section 
quarter-wave impedance matching circuit, which does utilize the Pade approximation to find the filter 
coefficients. 

The FDTD model used to validate these digital filters was a 1mm wide microstrip transmission line 
on a 0.25mm thick air substrate with a 10mm gap. The model boundaries were terminated with a PML 
absorbing boundary condition. The microstrip line was excited by a Gaussian pulse modulated at 20GHz 
with a bandwidth of 40GHz. The filter output voltage was measured on the other side of the gap from the 
source, and then normalized by the spectrum of the exciting function to find the filter response. The phase 
is corrected for the time delay between the source and the gap, and between the gap and the measurement 
location. The time step used for this model yields a sampling frequency j /j = 2.3983THz. 

The first recursive filter validated was the cascade connection of four single-pole lowpass filters 
with the same cutoff frequency. The coefficients for this filter can be found by multiplying four single-pole 
lowpass transfer functions together, which produces the following expressions for the aggregate filter 
coefficients: 

*o=( l -*) 4 h x =Ax 

b 2 = -6x 2 b 3 — 4x 3 b A - -x A (18) 

- 14 . 445 — 

x = e ** 

the cutoff frequency f c for this filter is 8GHz. The ideal IIR filter response and the FDTD implementation 
are shown together in figure la, with good agreement. This principle of combining filter transfer functions 
can be applied to any cascade or parallel combination of HR filters. 



Figure 1—Magnitude and Phase of a (a)4-Pole Lowpass HR Filter and (b) a Bandpass HR Filter 
shown with the FDTD Implementation 


The second simple filter used in this validation was a bandpass filter. This type of filter is defined 
by the center frequency f c and bandwidth f B w of the passband. The recursion coefficients for this filter are 
given by [5]: 

a 0 =\-K a x =2(K-R)coJ(2n f / f ) 

a 2 =R 2 -K b x =2i?cos(27T%) b 2 =-R 2 ^ 
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where R and K are defined in equation (20). 

R=\-3Iez. K=- 

fs 

f c was chosen to be 10GHz, and the 3dB bandwidth fsw to be 2GHz. Figure 2b shows the ideal HR filter 
response with the FDTD implementation, with good agreement. 

Other filters used to validate this method include single-pole low pass, single-zero high pass, 
Chebychev, and notch filters. All of these filter types showed good agreement between the ideal HR filter 
response and the FDTD implementation. 

Impedance Matching Network 

As an application of this method to practical microwave circuit design, the frequency response of a 
microstrip three-section quarter-wave impedance transformer was included in FDTD as a recursive digital 
filter. 

Si i and S 21 for a maximally flat three-section transformer [10] between 50Q and 77D at 10GHz 
were found using Hewlett-Packard’s Microwave Design System (MDS) software package. Using the 
calculated frequency-domain Sn and transmission S 21 characteristics, the reflection and transmission 
impulse responses were found by inverse Fast Fourier Transform (FFT) [11], and fit to the FDTD time step. 
The Pade approximation in equation (10) was applied to these discrete impulse responses to produce 
approximate recursive digital filters that could be implemented in FDTD using equation (8). 

The FDTD implementation depends only on the value of the reflection and transmission 
coefficients and not on the actual impedance of the connecting lines, so the same FDTD model as in section 
HI was used for simplicity. The modulated Gaussian pulse used in the validation studies was excited at port 
1, and the time-domain reflection and transmission were measured. These signals were transformed to the 
frequency domain and normalized by the spectrum of the source waveform. The magnitude and phase of 
the desired response, the HR filter found by Pade approximation, and the FDTD implementation are shown 
together in figure 3a for Sn and figure 3b for S 21 - 


2Rcoi27r f / f )+R 2 

2-2cos(27r%) 



Figure 2—Magnitude and Phase of (a)Sn and (b)Sji of a Three-Section Quarter Wave Transformer 
found using MDS (dashed), Pade Approximation of the MDS results (dotted), and FDTD 
implementation (solid) 


Figures 3 a and 3b show that while the Pade approximation does not reproduce the full depth of the 
Sn null at 10GHz, the overall response shape for magnitude and phase is acceptably reproduced for both Sn 
and S 2 i. The Pade approximation introduces some error because the method for finding a t and bi utilizes 
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only the first L+M samples of the impulse response, losing information at the end of the impulse response. 
The multiple reflections taking place within the three-section transformer yield a very long impulse 
response, and even a large number of poles and zeros (L+A£=1000) does not capture enough information to 
reproduce the full depth of the 10GHz null. Continuing work is focusing on methods, such as least-squares 
approximation and frequency domain techniques, for finding an optimum set of poles and zeros using all 
the information available in the impulse response. Another source of error, exhibited by the HR filter and 
FDTD responses deviating at high frequencies is due to parasitic reactance at the microstrip open-end in the 
FDTD model. Future models will include a method for calibrating out the minor effect of these parasitics. 

Conclusions 

A new method for including arbitrary frequency-dependent linear networks with an arbitrary 
number of ports in FDTD using digital filtering techniques has been introduced. This technique relies on 
the equivalent source method for describing lumped circuit elements to implement the interface between the 
field interactions in FDTD and the voltage and currents used to describe the port characteristics of a device. 
A number of simple digital filters were implemented in FDTD to validate the use of HR filters, and an 
example involving a three-section quarter-wave impedance transformer was shown. 

Sources of error in this method include the direct application of the Pade approximation, which 
does not use all available information in the impulse response, and parasitics in the microstrip gap used in 
the FDTD model. Future work will calibrate out these parasitics, and will use a more sophisticated method 
for identifying the approximate IIR filter poles and zeros. 
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Abstract: The paper presents a hybrid technique, which combines the Method of Moments (MoM) 
with the Finite Difference Time Domain (FDTD) method to analyze electrically-short, thin-wire 
antennas located in the vicinity of inhomogeneous dielectric bodies. The antennas are dealt with in the 
frequency domain using the NEC code, while the FDTD is used to analyze the inhomogeneous part of 
the problem. The two sub-problems are linked together and hybridized by invoking the equivalence 
theorem . 


L INTRODUCTION 

The analysis of thin-wire antennas located in the vicinity of inhomogeneous dielectric scatterers 
is important in numerous electromagnetic applications, e.g ., wireless communication, electromagnetic 
compatibility and ground radar. A hybrid method that combines two numerical techniques, viz., the 
Method of Moments in the time domain (MoMTD) and the Finite Difference Time Domain (FDTD) 
scheme, was recently introduced in [1] to investigate this type of problem. The above approach takes 
advantage of the versatility of the FDTD for handling complex structures with arbitrary material 
properties, and the ability of the MoM to deal with thin wires; and yet, it overcomes some of the 
drawbacks [2] of using the FDTD alone to handle the same problem. The above time-domain hybrid 
method is quite general in its application, and is useful for studying the problems of transient 
excitation of an arbitrary thin-wire configuration in the presence of inhomogeneous objects that may 
also be arbitrary. However, it is not as numerically efficient as one might desire, at least for the case of 
harmonic excitation of wire antennas located in the vicinity of an inhomogeneous medium [3,4], and 
the method proposed in this paper has been designed to provide improved performance for such 
problems. The proposed scheme is a new hybrid method that combines the well-known NEC code [5] 
for frequency domain analysis with the FDTD technique. For the specific case of harmonically-excited 
short antennas, the new scheme uses fewer computational resources than the time-domain hybrid 
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method [1], and yet yields excellent results. The present method [6] is formulated in a self-consistent 
way, which does not require the use of iterative procedures to model the coupling between the 
different sub-regions. 


II. DESCRIPTION OF THE METHOD 

In this section we describe the proposed hybrid approach for combining the FDTD technique 
and the NEC code by considering the problem of a thin wire located in the vicinity of a dielectric body 
(see Fig. 1). The two numerical schemes are used to analyze specific parts of the problem for which 
they are best suited individually, viz ., the integral-equation technique for the wire and the FDTD for 
the dielectric body. The first step is to divide the original problem into the following two sub¬ 
problems: (a) thin wire antenna located in an unbounded free space; and, (b) the inhomogeneous object 
inside surface S\ with the thin-wire antenna removed. The solutions of the two sub-problems are 
interconnected through the surface equivalence theorem (Huygen’s principle). The process for linking 
the frequency domain variables used in NEC, to the time domain ones needed in the FDTD, is 
described below. 



Fig. 1: Thin wire antenna in the vicinity of a 
dielectric scatterer and the FDTD computational 
domain 


First, the Electric Field Integral equation (EFIE) for the current I(s\ induced on the surface of 
the thin-wire antenna by an incident field E ‘, is formulated by enforcing the boundary condition on 
the tangential electric field along the wire axis. Including the contribution of the field scattered by the 
inhomogeneous body, E B , within the total electric field at position s on the wire, the EFEE can be 
written as 

s • \E (s) + E s (. s ) + E B (s)] = 0 (1) 

where s is the distance along the wire axis, s is the unit vector tangential to the wire axis at position s, 
and E s is the electric field that would be scattered at the same position if it were in free space. The 
field £ 5 in(l)isan unknown, with a linear dependence on /(s), for which we do not have a closed- 
form expression in general; however, it can be computed numerically. 
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The point matching form of the method of moments is applied to (1) by dividing the wire into 
Ns straight segments and expanding the unknown function I(s) into the sum of the Ns basis functions, 
(/ = l,-^i) used in NEC [5]. For the antenna case, this transforms (1) into the matrix equation: 




As, 5 


i = N s 


( 2 ) 


where ty are the coefficients of the current expansion; V(Si) is the applied voltage at position s, on the 
wire ; At is the length of segment i; Zu are the elements of the NEC-impedance matrix [Z] when the 
wire is located in free space; and, Zy are the contributions arising in the presence of the dielectric 
body. They represent the tangential electric fields at positions s, on the wire, scattered by the 
inhomogeneous region by theyth basis function fj(s), and are computed numerically. 

To obtain the row j of the matrix [2 s ], we assume that the current is given by jj{s), and then 
compute the tangential electric field on the sampled points along the wire. To this end, we begin by 
introducing a Huygens' surface, S, around the thin-wire antenna (see Fig. 1), where the time-domain 
values of the equivalent sources are deduced from the magnitude and phase of field radiated by the 
antenna in free space, that were computed previously by using NEC. Next, the FDTD algorithm is 
applied, in the presence of these equivalent sources [7] but in the absence of the antenna, to the entire 
computational domain consisting of the total field (outside S) and the scattered field (inside 5) zones. 
The tangential electric fields are calculated at the specific locations Si (i-1, ...Ns) on the antenna after 
the steady state condition has been reached inside S, and interpolation is used if the points are not 
coincident with the FDTD grid points. Finally, we obtain the elements Z?ij (i-1, ...Ns) we were 
seeking, after transforming the time domain data into the frequency domain. The procedure is repeated 
for j-\,...fls, until all the rows of the matrix [Z B ] have been filled. 

Subsequently, a modified version of NEC is run to solve (2) and to obtain the current I(s) 
induced on the antenna, from which all of the fields in the FDTD computational domain can be 
derived. We can obtain these fields either by repeating the procedure described in the above paragraph 
for the actual current I(s), or by applying linearity if the observation variables have been kept in 
memory when solving for each of the$*$')’s. 

The entire procedure entails a running of the FDTD code Ns times and, hence, the method is 
only useful for antennas that can be modeled accurately by using a small number of basis functions. It 
should be pointed out, however, that the computational burden in running the program Ns times is still 
smaller than that in the FDTD scheme employing a very small cell size, which may be required to 
accurately model the thin wire antenna. 

HI. NUMERICAL RESULTS 


To validate the technique described in the last section, the hybrid method has been employed to 
solve several simple problems for which independent solutions can be obtained with NEC alone is 
possible. We now present the results obtained for a V antenna and a circular loop operating in a two- 
media environment. 
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3.1 V antenna 


We have investigated a V-antenna located in the proximity of an interface between free space, 
and either a homogeneous half space with a relative dielectric permittivity e r =4, or a perfectly 
conducting (PEC) plane. The inner angle of the antenna is cc=12(f, and the length of each of its aims 
is 6.6 cm, its radius is 0.8 mm, and the distance to the interface between the two media is 0.3 m. The 
antenna was excited by applying a 900 MHz voltage source, of strength 0.5 V, at each of the two 
central segments. It was modeled with Ns=6 segments, after determining that the results for free space 
simulations did not vary significantly when Ns was increased beyond 6. The FDTD solution space 
contained 80x80x80 cells, and a size of 1.5 cm. Figure 2 shows the magnitude and phase of the x- 
component of the electric field sampled at a set of observation points plotted with a dashed line in 
Fig.2b. The points are located at a distance of 7.5 cm away from the interface, with the first one at 
D-0.3 m from the antenna The solid lines plot the NEC results and the (■) symbol lines designate the 
ones obtained by applying the hybrid method proposed herein. It is evident that the results obtained 
with the hybrid scheme are in very good agreement with those derived via the MoM approach. 



0.0 0.2 0.4 0.6 

Distance (m) 

(a) 


0.0 0.2 0.4 0.6 

Distance (m) 

(b) 


Fig. 2: V antenna: Magnitude and phase ofE x sampled at a set of observation points 
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3.2 Circular loop antenna 


For the second example, we analyze the problem of a circular loop antenna located parallel to 
the same interface, between free space and a homogeneous half space with as was the case in the 
previous example. The distance from the loop to the interface is 0.3 m, the radius of the loop is 1.8 cm 
and the radius of the wire is 0.8 mm. The antenna was divided into 9 segments, and one of these was 
excited with a 900 MHz voltage source. Figure 3 shows the current at the feed point obtained by using 
NEC alone, as well as with the hybrid method described here. Figures 4 and 5 show the ^-component 
of the electric field and the x-component of the magnetic field at the same set of observation points as 
used for the plots in Fig. 2b. Specifically, they are located parallel to the interface and at a distance of 
7.5 cm from it. Again the agreement between the two methods is seen to be excellent. 




(a) 


(b) 


Fig. 3: Circular loop antenna: Current at the feed point, a) Magnitude, b) Phase 




(a) 0) 
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Fig. 4. Circular loop antenna: E z ata set of points. (a) Magnitude, (b) Phase. 




Fig. 5: Circular loop antenna: H x at a set of points, (a) Magnitude, (b) Phase. 
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Abstract — A review of some hybrid combination of asymptotic high frequency and numerical 
methods is presented for solving a variety of electromagnetic (EM) radiation/scattering configurations 
which cannot otherwise be analyzed in a tractable fashion by either of the methods when they are used 
alone. The numerical (or low frequency) methods are best suited to deal with electrically small 
portions of the radiation/scattering object because they generally become highly inefficient and even 
intractable for treating electrically large portions, whereas the high frequency methods are ideally 
suited for dealing with the electrically large portions but cannot deal with electrically small parts. 
Specifically, examples of hybrid solutions will be described for analyzing the coupling of edge and 
surface diffraction effects on a large, smooth convex boundary containing edges when illuminated by 
a distant wave, the scattering from a large inlet cavity containing a complex interior termination, the 
radiation from a cavity backed slot antenna on a large aircraft, and the radiation/scattering from a large 
planar phased array containing small antenna elements, respectively. 


L INTRODUCTION 

This paper presents a review of some hybrid asymptotic high frequency and numerical methods for 
analyzing a variety of electrically large EM radiation/scattering problems. Such hybrid methods 
become necessary when the radiating/scattering objects become electrically large, and also when they 
contain some electrically small portions as well. It is noted that the use of purely numerical methods is 
primarily restricted to low frequency regimes (or electrically small problems), because the number of 
unknowns to be solved increases dramatically with frequency. In contrast, asymptotic high frequency 
methods are best suited for analyzing electrically large problems. However, most large practical 
structures contain both, electrically large as well as small parts, in general. Clearly, therefore, no 
single technique, i.e. either low or high frequency technique, can be used to adequately analyze 
electrically large configurations containing electrically small features if the latter are significant 
contributors to the overall radiation/scattering effects. 

Some of the earliest works in the development of hybrid combinations of high frequency and 
numerical methods are found in [1,2], where a combination of ray methods and moment method 
solutions of governing integral equations are described. Subsequently, several other authors have 
introduced different hybrid combinations of high and low frequency techniques, as for example in [3] 
which combines physical optics and moment method solution of governing integral equations and also 
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introduces iterative techniques within the hybrid framework to further improve the accuracy. The 
present work reviews hybrid solutions that demonstrate their ability to solve relatively complex and 
fairly realistic radiation/scattering configurations. In particular, four different radiating/scattering 
configurations are selected and their hybrid solutions are briefly described in the following section. 


H. BRIEF DESCRIPTION OF SOME HYBRID SOLUTIONS 

In this section, the hybrid solutions to the following four radiating/scattering configurations are briefly 
reviewed in items (A) through (D). 


(A) Scattering by a Half-Moon Configuration 

Consider a perfectly-conducting cylindrical scatterer with a semi-circular or half-moon cross-sectional 
shape, which is excited by an EM plane wave. This two-dimensional (2-D) problem is assumed to 
have an electrically large half-moon cross-section. Such a canonical geometry is selected to study the 
coupling between the edge and surface diffraction effects; it is actually ideally suited for analysis via 
the high frequency, ray-based uniform geometrical theory of diffraction (UTD) [4]. However, the 
UTD requires a knowledge of the relevant edge diffraction coefficients that also simultaneously 
provide expressions for the launching of surface rays on the curved portion of the cylinder. These 
UTD coefficients can generally be found from the analytical solutions to appropriate canonical 
problems [4]. While the UTD coefficients for the diffraction by an edge have been known for the 
geometry in question for rays diffracted directly into external space by an arbitrary curved wedge [4], 
the edge excited surface ray launching coefficients have been found analytically only recently for 
describing the coupling of edge and smooth convex surface diffraction effects. Clearly, if UTD 
expressions are not currently available to describe some diffraction phenomena, then the method of 
analysis similar to that used for the half-moon geometry can be employed to efficiently obtain such 
UTD diffraction or launching coefficients, etc., in a numerical fashion via the hybrid approach 
described here. Such numerical UTD coefficients can be obtained if the functional forms of the fields 
are known analytically away from the diffraction/launching point; the unknown initial amplitude of 
the diffracted/launched field which constitutes the relevant UTD diffraction/launching coefficient for a 
discontinuity is then obtained numerically via the hybrid scheme. The solution discussed here for the 
scattering by a half-moon essentially provides numerical values for the UTD launching coefficients of 
the edge excited surface rays on the convex portion of the half-cylinder. Since the analytical UTD 
solutions for this case have only been found recently, its accuracy can be checked by comparison with 
the numerical hybrid solution being discussed here. The hybrid solution to the half-moon (semi¬ 
circular cylinder) begins by formulating an integral equation for the unknown currents induced on this 
configuration by an externally incident plane wave. This unknown current is expanded into a set of 
mixed basis functions; a very small subsectional basis set is utilized only at and very near the two 
edges of this half-moon, whereas a very small entire domain set is used elsewhere on the flat and 
curved portions. The entire domain basis functions contain the well known UTD Fock-type functions 
on the curved side and the ordinarily and slope-type UTD edge transition functions on the flat side, 
respectively, corresponding to two oppositely-travelling ray paths on this geometry, as described by 
the present authors in [5], The subsectional basis set can be pulse (or other suitable) functions. The 
unknown amplitudes of these handful of total basis sets can then be found numerically, via the 
moment method (MoM) solution which utilizes UTD functional forms for representing the unknown 
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current away from the discontinuities, thereby drastically reducing the number of unknown which 
needs to be solved via the MoM. The conventional MoM solution would require dramatically more 
unknowns since it does not utilize a priori knowledge of the current behavior away from the edges. 
Thus, the present hybrid approach utilizes the physics of the problem to achieve not only efficiency 
but insights into the scattering mechanisms. Numerical results for this case show that the edge excited 
surface ray phenomena is significant for certain aspects of incidence and scattering even for a very 
large half-moon as shown by the present authors in [5]. 


(B) Radiation/Scattering by a Planar Finite Array Configuration 

Consider a planar, electrically large rectangular array of short, thin dipole antennas in air. Such a 
configuration is selected for simplicity in illustrating the hybrid analysis of large finite arrays. The 
theory of periodic structures cannot be applied directly to solve a finite periodically-spaced phased 
array. Also, conventional numerical solutions based on element by element field superposition become 
highly inefficient and even intractable for large arrays. Following the ideas discussed in problem (A) 
above, one can again formulate an integral equation for the array element currents due to an external 
plane wave illumination (for the scattering problem) or due to a known voltage excitation at the feed 
location of each dipole element of the array (for the antenna problem). The unknown current shape on 
each dipole element is represented by a set of standard basis functions for thin wire dipoles, and 
requires only a pair of piecewise sinusoidal basis set for a dipole whose length is less than or equal to 
half a wavelength. The unknown amplitudes of these dipole currents defined at the feed points must 
then be solved via the MoM procedure. The number of unknowns is generally greater than or equal to 
the number of antenna elements of the array in the conventional approach; thus, the number of 
unknowns can become exorbitantly high for large arrays. On the other hand, if one can incorporate the 
recently-developed high frequency UTD ray-based functional forms for describing the behavior of the 
unknowns for elements which are not at or extremely close to the edge (boundary) of the array, based 
on the work in [6,7], then one can achieve a drastic reduction of the unknowns as demonstrated by the 
present authors in [7,8]. The unknowns at and near the edge can also be related along the edges via the 
impressed phase variation which in known [7,8] leading to a further reduction in unknowns. The only 
unknowns which remain are then primarily concentrated in very small regions near the array comers. 
Such a hybrid UTD-numerical procedure is found to be very efficient, and it also lends physical 
insight because of the UTD functional forms utilized in this approach. Numerical results based on this 
hybrid approach for arrays are shown to be quite accurate when compared to results obtained via the 
conventional or direct numerical schemes. 


(C) Scattering by Inlet Cavities 

It is not an easy task to determine the scattering from electrically large inlet cavities, especially when 
their interior termination is complex, as might be the case when modeling the scattering by a jet inlet 
cavity. However, it is seen here that a hybrid approach for predicting the scattering from inlet cavities 
provides a useful tool that leads to a relatively tractable solution. This problem is important because 
inlet scattering can contribute significantly to the overall RCS of modem jet aircraft. The hybrid 
approach described here is based on breaking the problem up into its component parts, then treating 
each part separately in an efficient way, and finally combining the different solutions appropriately. 
Here, the finite difference time domain (FDTD) method is used to determine the reflection matrix 
associated with the termination in terms of the circular waveguide modes since the jet inlet region near 
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the engine termination is generally circular. The FDTD method is employed since it is more efficient 
than the frequency domain methods in obtaining a broadband frequency information. On the other 
hand, the long front portion of the relatively arbitrarily shaped inlet is analyzed via high frequency 
iterative physical optics (IPO), or the generalized ray expansion (GRE) and its modifications to 
include elliptic ray tube basis functions. The fields within the inlet are excited by an externally 
incident plane wave, and the fields are tracked from the open front end to the circular cross-section 
termination region via IPO, or via GRE. The IPO or GRE-based fields are then expanded into a 
circular waveguide modal sum; these are then coupled to the termination region FDTD solution which 
is converted to the frequency domain and also expanded in a set of circular waveguide modes. The 
coupling is carried out via mode matching. This hybrid procedure has been developed by the present 
authors in [9]. Numerical results based on this hybrid IPO/GRE-FDTD analysis are shown to give 
very reasonable accuracy when compared to reference solutions for simple circular inlet shapes. The 
RCS of relatively complex inlet shapes have also been obtained. 


(D) Radiation from a Cavity Backed Slot Antenna on Aircraft 

In this problem, a cavity backed slot antenna with a probe feed inside the cavity is flush mounted on 
the fuselage of a relatively realistic aircraft model. The cavity backed antenna is first placed on an 
infinite planar perfectly conducting surface, and the fields in the slot are obtained by solving an 
integral equation for the equivalent magnetic currents corresponding to the slot fields. Since the cavity 
is rectangular in shape, the interior Green's function is expanded into a set of interior rectangular 
cavity modes, whereas, the external fields are obtained via the free space Green's function in the 
integral equation formulation. The integral equation is solved via MoM. Once the equivalent sources 
are found via the conventional MoM, they are then assumed to be unchanged to a first order of 
approximation when placed on an electrically large smooth aircraft fuselage. These sources are then 
allowed to radiate in the presence of the aircraft, and the radiated fields are found using the high 
frequency UTD method via the NEW AIR code developed at The Ohio State University 
ElectroScience Laboratory. This hybrid procedure which has been developed by the present authors 
[10] is different form the previous ones discussed above, but at the same time is in some manner 
distantly related to that described in problem (C) above. Numerical results based on this hybrid 
approach have been obtained for sufficiently realistic aircraft models such as for the F-16 fighter 
configuration. 


in. CONCLUSION 

It is shown that the hybrid approaches which systematically combine asymptotic high frequency and 
numerical methods can be employed successfully to solve a variety of electrically large EM radiation 
and scattering problems which cannot otherwise be solved in an efficient or even tractable manner by 
any one method alone. 
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1 Introduction 

The electromagnetic scattering from the interior of a complex jet engine inlet contributes sig¬ 
nificantly to the overall radar cross section (RCS) of a modern jet aircraft. The scattering 
mechanisms in jet or missile inlets are complicated and difficult to simulate accurately. The ge¬ 
ometry is both complicated (engine face, structural obstacles, materials) and electrically large. 
Several authors have developped hybrid numerical methods in the frequency domain to solve 
this problem ([2, 3]). 

Recently, an hybid method combining Integral Equations, Partial Differential Equations and 
Modal expansion has been developped at ONERA ([1]). In this paper we present several results 
obtained with this method called FACTOPO applied to the RCS modulation of Antennas and 
Rotating Fans. 

The global target fI is split into Afy subdomains (V5)t=idV v separated by Nj fictitious sur¬ 
faces (rjbssi^ATj (Figure 1). For each subdomain (even the exterior three-dimensional volume), 
the generalized Scattering matrix Si is computed with different methods such as the three- 
dimensional finite-element method (FEM) or the electric field integral equation (EFIE) com¬ 
bined with a generalized waveguide modal representation of the fields on the fictitious surfaces 
([4]). We use waveguide modes as expansion functions on the fictitious surfaces to model the 
coupling of sub-domains.Then, the different objects are connected by cascading the different 
S matrices. In a context of parametric investigations, the scattering matrices of the modified 
domains have to be re-evaluated, the other ones are simply re-used in the connection step. This 
strategy reduces significantly the computation time compared to traditional hybrid methods for 
which a geometric or electromagnetic modification requires a. new computation of the complete 
target. 

We will emphasize the strong reduction in computational ressources obtained with this coupling 
scheme and especially the waveguide modal expansion. The advantages of this formulation for 
parametric studies will be illustrated by two cases: computing the RCS of an air-intake termi¬ 
nated by a flat PEC or a rotating fan, and of an antenna structure coupled to an electronic 
feed with a varying parameter. 
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2 Formulation 


The target fl is split into Ny subdomains (K‘)*=i,atv by Nj fictitious surfaces (Fj^-i^ (Figure 
1). Note that Vo is an unbounded volume, and the volumes {Vi,.., V/v v } are bounded. In each 
bounded or unbounded volume VJ of the decompozition, the boundary dV{ is made of n{ fictitious 
surfaces Fj and a part of Fo- The surrounding medium has permittivity £o and permeability p 0 . 
The scatterer is illuminated by an incident wave {E sourcei H sourC e} with wavenumber k — ujc. 



Figure 1: Target decompozition. 


2.1 Modal functions on the fictitious surfaces 

On each fictitious surface Tj of VJ, the tangential fields (n x E, n x H) are expanded in a set 
of waveguide mode functions (n x ££, n x H*). These modes are computed on planar surfaces 
of any shape with a finite element method ([4]). If pj denotes the number of modes on Tj, the 
total tangential fields are written as: 

f (n x E) rj = Ep.,, P ,K, + H,p)( n x El) 

1 (» X H) T, = Ep=i, Pj « P - K P )(n X Hi) K) 

We shall now introduce these field decompozitions in the EFIE formulations. By linearity of 
the Maxwell equations, the tangent magnetic field on each of the n : - fictitious surfaces F 7 of the 
volume Vi is a linear combination of the following fields: 


(n X H)dVi — ^2 ( a i,p bi,p){ n * $ excitation) dVi + X Hsource )dVi (2) 

j=l,n, p-l,pj 

EFIE operators P,Q,B,S ([1]) can be used in this formulation to compute the H J e £ citation fields 
for both bounded and unbounded volumes: 


(B-S)(4) = (P + Q)(Ki) (3) 

with Ji = nx K ] p = Ej, x n. 

The scatterer is illuminated by an incident wave {E source , H source } located in the exterior un¬ 
bounded domain Vo. The J SOU rce = n X H $ovrce field is computed with the integral equation 

(B - S){J s ource) = 2 P(E SOU ras X n) ( 4 ) 
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2.2 Scattering matrices 

We shall now characterize the volume V by its scattering matrix Si. On all of the n* fictitious 
surfaces Tj, the tangential components of the total magnetic fields are written: 

^ ( a lp “ X ^p) = { n x $$ourct,Bv}jTj 

+( E E Kp + KM n * aS-wtaM/Ti 

j=l,ni p=l,pj 

(5) 

We define the hermitian scalar product of the vectors u, v : {u,v) r . = u.v*ds. By expressing 
the h* coefficients associated with the outgoing waves in terms of the aj coefficients associated 
with the incoming waves on the n, fictitious surfaces, we obtain the following system: 



The scattering matrix of V} is defined by the Si^Uf 1 .Vi matrix of the right hand side of equa¬ 
tion (6). The vector h SO ur C e=^o" 1 *^o corresponds to the outgoing waves generated by the H source 
fields in Vo • 

After the scattering matrices Si,..,5jVv of the cavity domains have been computed, the interior 
domain is assembled with a standard connection scheme ([5]) yielding the scattering matrix 
Si nt . Finally, the exterior domain Vo and the interior V/ n * are assembled by identification of 
the outgoing waves on the fictitious surface Vi. We get the following linear system: 

[Id - SoS/n«] {&0 = (7) 

Then, once the vector {6j} has been determined, the total electromagnetic field is recombined 
on 8Vq for all the incident waves. 

3 Modelization of the fan rotations 

The blade rotations in the engine subdomain is a particular case of parametric investigations. 
The challenge is obtain the RCS for all the angular positions of the engine from the computation 
of a reference position. In a general manner, a scattering matrix is associated to a linear appli¬ 
cation S and a base (/ p ) and express the outgoing waves in function of the incomining waves 
on each interface of the subdomain. The algorithm consists in writting the scattering matrix 5 
in a new base ( g p ). If P is the matrix transforming base ( f p ) in base (g p ), the associated matrix 
S in the (g p ) base is derived by : S= P~ l .S.P . The modelization of the engine rotation is then 
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realized with a simple rotation of the basis functions (f p ). In this particular case, the fields are 
expanded on modal circular functions and the matrix P(<p) for a rotation angle tp is analytical 
as well as its inverse In addition P(<p) and F -1 (<^) ^ ave no more than two non zero 

coefficients on each row. The engine scattering matrix S((p) for a blade position making an 
angle <p with a reference position <^=0 is then obtained with only two matrix products 5(<^)= 
P~ l {(p).S(#).P(<p). If N is the total number of edges unknowns of the engine subdomain, NB 
the number of modal functions on the interface, then we need 1&.K.NB 3 operations to compute 
the scattering matrices associated with K different positions of the engines blades. The same 
investigation with a full analysis for the K blades positions will required K * (f Af 3 + 8. N 2 .NB) 
operations. 


4 Test-cases 


4.1 Air intake geometry : ” CHANNEL” 

The geometry of this case is an evolutive channel enclosed in a circular cylinder. The entry 
of the channel has an elliptic cross-section, the end of the channel has a circular cross-section. 
The object is perfectly conducting. The geometry is presented in figure 2. 

The wave vector of the incident wave for <^=0 is directed along -X, and the electric field is 
parallel to Y ((/></> polarization) or to Z ( 66 polarization). 


X 


z 



Y 



Figure 2: Geometry of the problem (all dimensions are in meters). 


The location of (y s , z s ) the center of the waveguide cross-section is a function of x: 

V. = -0.04878 + 0.116sin 2 (^ ; z, = 0 

The elliptic cross-section is also a function of x : 


For 0 < x < 1.300 


(E 


Z—Za 

11 SQ3-f0.02568 X f^ 0 3 0 ° - Q 


) +(«3 


05877-0.03358- 


“-^r) 2 = l 


For 1.300 < x < 1.360 


V0.11803/ “ V0.05877^ x 
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The far field RCS is computed at 3 GHz for the two polarizations and 66. We have intro¬ 
duced a spherical incident wave emitted at 8.73 m from the point (0.592,0,0) so as to reproduce 
the measurement conditions . The target is split in two volumes Vo (39,627 unknowns) and Vj nt 
(33,316 unknowns) each one computed with the EFIE. The tangent fields on Pi are expanded 
with 13 propagating modes. The figure 3 show the RCS versus the angle of incidence in the <j><f> 
and 96 polarizations. The results are in agreement with the measurements. 



4.2 Antenna geometry: ”DENEB” 

The geometry of this case is a rectangular horn enclosed in a circular cylinder. The horn is fed by 
an electronic network. The object is perfectly conducting. The geometry is defined by figure 4 . 

The wave vector of the incident wave is directed along -Z, and the electric field is parallel to 
X ( cj>(f> polarization) or to Y ( 66 polarization). The target is split into three volumes : Vo the 
exterior domain, Vi the rectangular horn and \\ the rectangular waveguide fed by the electronic 
network. 



Figure 4: Comparison of measured and computed Modulated RCS for DENEB, 
Incidence zero degree, 66 polarization , F=4.o Ghz. 








The frequency of interest is 4.5 Ghz. The spherical incident wave is generated at 8.71 m from 
the point (0,0,0.251). The mesh of Vo leads to 32,904 surface unknowns and the scattering 
matrix So is computed with the EFIE formulation. The interior horn domain Vi has also been 
computed with the EFIE solver (7,161 surface unknowns). The fictitious surface Ti between 
Vo and Vi is located at 0.03333 m from the aperture. The tangent fields on T i are expanded 
with 30 propagating modes. The vertical fundamental mode (electric field parallel to Y) is 
used on the second fictitious surface localized in the rectangular waveguide V 2 at 0.03333 
meter from the junction with the rectangular horn. The scattering matrix 5 2 of V 2 is measured 
for 23 states of the electronic network. These parameters are defined on the reference plane 
localized at the junction between the horn and the rectangular waveguide i.e at 0.0763 m from 
the termination of the cavity. 

Figure 4 shows the comparison between the computed and measured RCS versus the vary¬ 
ing parameters of the electronic alimentation for an incidence angle of zero degree in the 90 
polarization. The results are in agreement with the measurements. 

4.3 Engine modulation 

The following test (Figure 5) is the evolutive inlet CHANNEL ( interior domain Vj nt is the 
connection of the inlet subdomain Vi and the engine subdomain V 2 ) inserted in a cylinder 
(exterior domain Vo). The frequency of interest is 3 Ghz and the RCS are computed in the two 
polarizations from —10 to 45 degree. The fields are expanded on 17 propagating modes on the 
engine interface. The V 2 subdomain is a single rotating blade as described on Figure 5. The 3 
GHz monostatic RCS for two different blades position (p are computed by cascading the Si and 
§ 2 (<p) matrices and solving the linear system (7). One complete period of the RCS modulation 
pattern has been computed in a post-processing phase with the above algorithm for an azimuth 
angle of 30 degree using the reference scattering matrix ( 9 = 0 ) caracterizing the V2 subdomain 
(Figures 6 ). The agreement with a full analyzis using the EFIE solver is very good. 



Figure 5: Air-Intake terminated by a simple rotating structure 
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CHANNEL 3GHz , One Blade termination 
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Figure 6: Complete period of the RCS modulation 
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5 Conclusion 

In this paper, we have introduced a multi-domain and multi-method technique for analyzing 
the EM scattering from 3-D cavities. The Electric Field Integral Equation is used to compute 
the generalized Scattering matrices of the subdomains. We have used waveguide modes for 
modeling the field propagation from the cavity opening down to the obstacles or the engine face. 
RCS patterns of various complex cavities including modulation were presented and compared 
to measurements to show the accuracy of this method, the reduction in computation time and 
the advantage of this method for parametric studies. 
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Abstract 

A real-time interface between Transmission Line Matrix (TLM) and Haar-Wavelet based Multi-Resolution Time 
Domain (MRTD) algorithms is presented and described. Like other hybrid combinations of different methods, the 
present scheme may offer improvements in computational efficiency and permits embedding of special features and 
boundaries into MRTD. Since the Haar MRTD scheme is operated at a time step twice that of the TLM scheme 
for equal spatial sampling density, the connection algorithm does not require any interpolation or extrapolation 
either in time or in space. The proposed algorithm exhibits smooth field distribution without spurious distortions 
even around discontinuities or obstacles, which could not be achieved with conventional MRTD schemes. 

I. Introduction 

Application of wavelets in the time-domain numerical modeling of electromagnetic fields has 
been extensively studied [l], [2], [3] for the purpose of reducing computational expenditure of the 
analysis methods. The advantage of those wavelet based techniques resides in their highly linear 
numerical dispersion property and the possibility of using much coarser discretization compared 
to the conventional space discrete methods, thus leading to significant reduction of the number of 
unknowns in the analysis region. Even the simple Haar wavelets can improve the computational 
efficiency of the numerical analysis through the coupling of scaling and wavelet basis functions at 
excitation and boundary conditions [4], [5], [6], [7]. 

However, when the analysis region contains complicated boundary conditions or small struc¬ 
ture features, the discretization is restricted by the smallest feature of the geometry. Therefore, 
embedding boundary conditions of small structures curtail the advantage of the wavelet based 
techniques, and hence it is one of the critical issues to be addressed. 

One possible way to overcome this difficulty is to combine with other methods. Hybrid con¬ 
nection schemes between the TLM and Finite-Difference Time-domain (FDTD) methods were 
proposed for both the two-dimensional (2D) [8] and the three-dimensional (3D) [9] cases. Since 
Haar MRTD algorithms lead to update schemes that are similar to those of FDTD, the hybrid 
connection with TLM has certain similarities with the TLM-FDTD connection algorithm. How¬ 
ever there are a number of differences due to the different size of the MRTD and TLM cells, and 
due to the difference in time step. 

In this paper, the details of the algorithm will be presented and the results of numerical vali¬ 
dation will be reported. 


II. Connection algorithm 

An algorithm for connecting 2D Haar MRTD for the TE polarization and the 2D TLM shunt 
network will be described. As shown in Fig.l, one MRTD Yee cell and two TLM cells are over- 
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lapped at the interface such that the sampling points of the Haar MRTD cell and the centers of 
the TLM cells are collocated. This alignment allows us to avoid both spatial interpolation and 
extrapolation at the interface. 



• Ey 

-►Hz 

t Hx 

• TLM node center 


■*" iniP4 
i «cV 3 

o MRTD sampling point 


Fig. 1. Connection between the 2D Haar MRTD with the TE polarization and the 2D TLM shunt network. 

The connection algorithm consists of four steps; A-l) at the end of the MRTD region, transfer 
data from TLM node voltages to MRTD E-fields, A-2) at the end of the TLM region, transfer 
data from MRTD E-fields to TLM incident pulses, B) update MRTD H-fields, and perform TLM 
scattering and connection, C-l) at the end of the MRTD region, transfer data from TLM node 
currents to MRTD H-fields, C-2) at the end of the TLM region, transfer data from MRTD H-fields 
to TLM incident pulses, and D) update and excite MRTD E-fields, and perform TLM scattering, 
connection and excitation, then go back to A-1,2) to repeat the cycle. 

The time synchronization of the connection algorithm is depicted in Fig.2. Since the time step 
of Haar MRTD is twice that of TLM, the TLM scattering and connection processes are performed 
twice while MRTD E- and H-fields are updated once. The advantage of combining MRTD with 
TLM is that, in the TLM scheme, both voltages and currents are defined at the same location and 
at the same time. Therefore, field data can be transfered either from TLM to the MRTD E-field 
or from TLM to the MRTD H-field, and vice versa. Because of this advantage, no interpolation 
or extrapolation neither in time nor in space is required. 

III. Formulation 

The formalism will now be described for each step of the connection algorithm. Let us consider 
the interface shown in Fig.3. 

A. At Time Step “n” 

The connection is first made between TLM node voltages and Haar MRTD E-fields. For data 
transfer from TLM to the MRTD E-field, we first sample TLM node voltages at four TLM nodes 
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Fig. 2. Time synchronization between the Haar MRTD and the TLM methods.-: update TLM and MRTD, 

-: transfer from MRTD to TLM,.: transfer from TLM to MRTD. 



Fig. 3. Data transfer between the 2D Haar MRTD and the 2D TLM shunt nodes at the interface. Arrows with 
dotted lines represent the transfer of field data between the two methods. The definition of the incident pulses 
at the TLM branches are shown in the lower left comer. 


that surround the MRTD E v node as 


n r n-1/4 . xi—1/4 , n—1/4 . n—l/4-i 
Vy — 2 l* nc ^l *"h inc v 2 “h inc^3 H - inc^4 j' 


The relation between the TLM node voltage and the Haar MRTD E-field is defined by 




Then those four voltages are decomposed into the Haar wavelet basis by 


- -I 1 1 1 1 1 r ■ 

**E% _ 1 1—1 1-1 ,U E£ 

**E£ _ 4 1 1-1-1 *£% 

J 1 -1 -1 1 J L 


( 1 ) 

( 2 ) 

(3) 
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where ll, lu , ul and uu denote the corresponding Haar MRTD sampling points, for example, lu 
denotes the sampling point at the lower side in the ^-direction and the upper side in the ^-direction. 

For data transfer from the MRTD E-field to TLM, the MRTD E-field at the TLM node centers 
on the last column of the TLM nodes are reconstructed by 


“££ ] 


' 1 1 1 1 


* H>^n 



1-1 1-1 


W>£n 

“ i E" 


1 1-1-1 


rpcpgn 



1-1-1 1 


^Ey 


Since these E-field values are equivalent to TLM node voltages as in (2), the incident pulses at 
the end of the TLM region can be estimated by 


incvT V4 


2v” - + incV% 1/4 + faet#~ 1/4 ) 

2iey - ut»r V4 +inc«r l/4 + incvr 1 '*). 


(5) 


Thus we have all the TLM incident pulses required for the following TLM scattering process. 


B. Transition from V to “n + \/2” 

The fields are now to be updated both in the TLM and Haar MRTD regions individually. For 
H-field update through Haar MRTD, 


^rrn+l/4 
z **-i+l/ 2 ,k 

?»?tT n + 1 / 4 

*"*,*+1/2 


ftrrn-l/4 __ AtM (£m ?n _ £r?z?n 


( 6 ) 

(7) 


for £,t] = <j>, ip, where A t M and A l M denote the Haar MRTD time and space increments, i, k and 
n denote the index in the x ^-directions and time, respectively. 

For TLM, the standard 2D shunt node algorithm is performed, which consists of scattering and 
connection procedures 


r ef[V] n+1/4 = [S] inc[V] n - lIA (8) 

. nc [F]" +1/4 = [C] ref[V] n+1 ^ (9) 

where [S] is the scattering matrix and [C] is the connection matrix. Note that for consistent 
excitation of TLM and MRTD regions, excitation is not occurred at this moment but occurred 
only at the following update step described later. 

C. At Time Step “n -f 1/2” 

After updating H-fields in MRTD, field data are transfered between TLM node currents and 
Haar MRTD H-fields. For data transfer from the TLM node currents to the Haar MRTD H-fields, 
TLM node currents i x are sampled at four TLM nodes that surround the MRTD H z node as 

*r l/2 =4-u»r l/4 -i n c% B+1/4 ] (io) 

A 


1016 


where Z\ is the TLM link-line impedance. Similarly, i z are sampled at four TLM nodes that 
surround the MRTD H x node as 


•n+1/2 _ »+1/4 _ . n+l/4i 

7 Unc^2 tnc v 4 }• 

Then these currents are converted to the MRTD H-field by 

jjn+l/2 ^ _^ n+ 1/2 

ffn+l/2 _ jn+l/2 

Then the H-fields are decomposed into the Haar basis by 


(ii) 


( 12 ) 

(13) 


(14) 


for w — x and y. 

Inversely, data transfer from Haar MRTD H-fields to TLM node currents is achieved first by 
reconstructing F x -fields 


- n+l/2 * 


r i 

i 

i 

i 

1/2 

1 

i 

-i 

i 

-i 

i)4>jjn+\(2 

~ 4 

i 

i 

-i 

-i 

tWffn+ 1/2 


i 

-i 

-i 

i 


* 

Htfn+l/2 -I 


lujjn+ 1/2 


uXjjn+l/2 

. 

imjyn+l/2 


■ llffn+l/2 ~ 


lu jy n+l/2 


uljjn+1/2 

— 

uujjn- f 1/2 
iJ x - 



1111 
1-1 1-1 
1 1-1-1 
1-1-1 1 


-| 

- ‘W’jFfn+l/2 - 


<tnpjpi+l/2 


iptpffn+l/2 

. 

‘tlnpjjn+1/2 


(15) 


and then, with the relation (10),(11),(12) and (13), the incident pulses at the end of the TLM 
region can be estimated by 


inc^A 


- 4 n+ V4 = _ Z[ M + . n .,n + l,4 


' incV2 

= ZiH^ +l/2 + inc !)" +1/4 - 


(16) 


It is then possible to perform the next TLM scattering process. 

D. Transition from “n + \j2” to “n + 1 * 

The field values are now updated again with TLM and a Haar MRTD E-update equation. The 
Haar MRTD E-update equation is given by 

K,k 

Mm 


W 1 = iKk 


+ 


cMm 


2 - ?*$£{% - (it) 


for 7i = where pe* k is the amplitude of an E-field excitation. 

The same TLM algorithm as described above, together with an excitation process at this time, 
is executed as follows: 


re/[VT +1/4 = [5] jnc [vr 1/4 + \e[I] (18) 

where [I] is the identity matrix. This excitation procedure gives smooth excitation distribution 
over the TLM and the Haar MRTD regions. This concludes the connection algorithm, and the 
cycle is repeated. 
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IV. Numerical Validation 


Two cases, a simple rectangular resonator and a square resonator loaded with a perfect con¬ 
ductor fin, were analyzed with the present method. 

A. Simple Rectangular Resonator 

An air filled simple rectangular resonator was first analyzed. Preliminary experiments showed 
that the connection algorithm was stable when the propagation of wave fronts was at 45 degrees 
with respect to the main axes. In order to separate effects due to the symmetry of boundaries 
and effects due to symmetries of the interface topology while maintaining 45 degree oblique prop¬ 
agation, the width and the length of the resonator were taken to be y/2 and \/2/2 as shown in 
Fig.4 (a). The interface region was placed asymmetrically and the TE 12 mode was excited such 
that the field is a superposition of plane waves propagating at 45 degrees with respect to the main 
axes. All the numbers are normalized such that the speed of light is unity. 

In this analysis, the algorithm was stable at time step 10 5 and no instability was observed. The 
E-field distribution at that time step is shown in Fig.4 (b) and exhibits no spurious distortion. 
The Fourier transform of the time signal indicated that the resonant frequency was exactly 1.0 
with an error less than 1.0 x 10“ 5 . This is reasonable because the 2D TLM and Haar MRTD 
methods have no numerical dispersion for the 45 degree diagonal propagation; TLM gives accurate 
results for the 45 degree propagation regardless of the discretization level, and so does the Haar 
MRTD when the stability factor is the maximum allowable value, i.e. s = cAt^/AlM = l/\/2. 



(a) (b) 

Fig. 4. (a) Configuration of the simple resonator. The size of the resonator is V2 x >/2/2. The dotted lines show 
the connection interfaces. The numbers beside the arrows indicate the number of equivalent TLM cells, (b) 
The E-field distribution in the simple resonator. 


B. A Metal Fin Loaded Square Resonator 

The connection algorithm was applied to the analysis of a square resonator loaded with a perfect 
conductor fin (Fig.5). 

In this case the algorithm was stable for 800 time steps, and an abrupt amplification of the field 
values, so-called late-time instability, was observed. The time signal is shown in Fig.6 (a). The 
resonant frequency was 1.09445. However, until the onset of the instability, the field distribution 
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Fig. 5. Configuration of a square resonator and waveguide loaded with a perfect conductor fin. The size of the 
structure is \/2/2 x \/2/2. The numbers beside the arrows indicate the number of equivalent TLM cells. 


remained smooth without spurious distortions even around the edge of the fin (Fig.6 (b)), which 
was difficult to achieve with the conventional boundary implementation in the MRTD method. 




(a) 


(b) 


Fig. 6. (a) The time signal of the square resonator loaded with a perfect conductor fin. (b) The E-field distribution 
in the square resonator loaded with a perfect conductor fin. 


C. A Metal Fin Loaded Waveguide 

The same structure as the previously analyzed fin-loaded square resonator (Fig. 5) was then 
analyzed as a waveguide. The top and the bottom walls were replaced with narrow-band absorbing 
walls; the reflection coefficient for the TLM link lines at those boundaries were set to be -0.1715 to 
realize approximate zero-reflection walls. Due to these narrow-band absorbing walls, the excitation 
signal was also band-limited with the normalized center frequency 1.0. 

Interestingly, as shown in Fig. 7 (a), the algorithm is stable at time step 10 5 and no late-time 
instability is observed. In the analysis of open structures using absorbing walls, numerical errors 
associated with the interface do not accumulate in the analysis region, and the late-time instability 
disappears or delays far behind. The field distribution is also smooth even in the vicinity of the 
metal comers as shown in Fig. 7 (b). 
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Tima 


(a) 


(b) 


Fig. 7. (a) The time signal of the waveguide loaded with a perfect conductor fin for 10 5 time steps, (b) The E-field 
distribution in the waveguide loaded with a perfect conductor fin. The wave travels from right to left. 


V. Conclusions 

A real-time connection algorithm between the TLM and the Haar MRTD methods has been 
presented. The algorithm is stable for the case where a wave propagates at 45 degrees with 
respect to the main axes and is practically stable for open structures such as waveguides loaded 
with metal fins. It is stressed that the algorithm provides smooth field distribution without 
spurious distortions even in the vicinity of metal corners. Since the implementation of boundary 
conditions in the MRTD method is one of the most critical problems, the present combination 
technique is a first step towards overcoming this difficulty. 
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